How Airbnb is making its data more reliable

Airbnb is overhauling the way data generated by users on its website is ingested into its Hive data warehouse to reduce outages and bring greater accuracy and reliability to its decision-making. Data infrastructure software engineer Krishna Puttaswamy told the 2016 Hadoop Summit last week that the company needed to improve the timeliness, completeness and quality of its data reserves. “Airbnb wants to offer the best travel experiences to our users and we believe data can play a critical role in offering those experiences,” Puttaswamy said. Airbnb primarily drives insights out of its data through the use of machine learning models, though it also surfaces data via a set of dashboards. “We have various products and various teams using machine learning models built on top of the data in the warehouse,”…


Link to Full Article: How Airbnb is making its data more reliable