Big Data Benchmark Gauges Hadoop Platforms

Bakhtiar Zein/Shutterstock.com In another indication of a maturing technology and growing demand, an industry group has released a big data analytics benchmark designed to gauge the performance of Hadoop-based systems. The Transaction Processing Performance Council said this week its TPCx-BB benchmark for big data analytics systems covers systems such as MapReduce, Apache Hive, Apache Spark and Machine Learning Library, or MLib. According to the TPC website, the “express” benchmark measures the performance of Hadoop-based systems, including hardware and software components. The benchmark executes 30 frequently performed analytical queries in the context of “retailers with physical and online store presence.” The queries are expressed in SQL for structured data and in machine learning algorithms for semi-structured and unstructured data. SQL queries can use Hive or Spark while machine learning algorithms use…


Link to Full Article: Big Data Benchmark Gauges Hadoop Platforms