Yahoo opens up 13.5TB machine learning dataset for academic research

Yahoo is unloading what it is boasting to be the largest-ever machine learning dataset made available publicly for the academic research community. Suju Rajan, director of research at Yahoo Labs, elaborated in prepared remarks that the search company is publishing the dataset with the goal of encouraging innovation — but especially in regards to how data from machine learning technologies can be turned around and used for new purposes. “Many academic researchers and data scientists don’t have access to truly large-scale datasets because it is traditionally a privilege reserved for large companies,” Rajan remarked. Dubbed the Yahoo News Feed dataset, the collection is actually just a sample set of anonymized user interactions from approximately 20 million users tuning into a variety of Yahoo properties, including Yahoo Finance, Sports, Movies, Real…


Link to Full Article: Yahoo opens up 13.5TB machine learning dataset for academic research