Building Data Science Applications on Databricks

Try this notebook in DatabricksThis is part 2 of a 3 part series providing a gentle introduction to writing Apache Spark applications on Databricks. This post focuses on the tools and features that are helpful for data scientists to solve business problems instead of managing infrastructure. The big challenge for data scientists is to take a model from the prototyping all the way into production. This process is often littered with a variety of different environments, samples of data that do not reflect production data quality, and a litany of infrastructure challenges. These problems become even worse when multiple teams are restricted to sharing one cluster for all of their work. In these challenges lie the reason that, as a Solutions Architect, I’ve seen many data science teams become successful…


Link to Full Article: Building Data Science Applications on Databricks