Spark man Zaharia on 2.0 and why it’s ‘not that important’ to upstage MapReduce

Interview Spark is the open source cluster computing system started in 2009 by Matei Zaharia, when he was but an ‘umble PhD candidate at Berkeley’s AMPlab. Some people hope it will become the logical successor to MapReduce. Donated to the Apache Software Foundation in 2013, Spark has been backed by IBM. Proponents of Spark say it is superior to MapReduce, Google’s data processing framework, which formed the basis for Hadoop – another cluster system and competitor in the Big Data support and services sector. Companies such as Cloudera have announced initiatives allowing Spark to become the successor to Hadoop, and Spark is now supported by practically every business in the sector. Ahead of February’s Spark Summit in New York City, Zaharia told The Register that merely “replacing MapReduce is not…


Link to Full Article: Spark man Zaharia on 2.0 and why it’s ‘not that important’ to upstage MapReduce