Announcing Enhanced Apache Spark Support

Apache Spark has captured the hearts and minds of data professionals. A technology originally developed at Berkeley’s AMP lab, Spark provides a series of tools which span the vast challenges of the entire data ecosystem. Spark offers support for ingestion of real-time data via streaming, for large-scale distributed ETL, and even for analysis and modeling with MLLib and the newly added data frames API. At Domino, we feel that modern data science teams are fundamentally polyglot ecosystems, where many different tools with different philosophical and architectural approaches play an important role. While Domino has long had the ability to run code which triggered Spark jobs and connected to SparkSQL data sources, we’re proud to announce significantly enhanced Spark support: Broad deployment and configuration support, with local mode and stand-alone cluster…


Link to Full Article: Announcing Enhanced Apache Spark Support