Yahoo releases machine learning dataset for researchers

Yahoo announced the public release of a machine learning dataset to the academic research community. With this release, the company aims to advance the field of large-scale machine learning and recommender systems, and to help level the playing field between industrial and academic research. Described as the largest-ever, the The Yahoo News Feed dataset is a collection based on a sample of anonymised user interactions on the news feeds of several Yahoo properties, including the Yahoo homepage,Yahoo News, Yahoo Sports, Yahoo Finance, Yahoo Movies, and Yahoo Real Estate. The dataset stands at a massive 110B events (13.5TB uncompressed) of user-news item interaction data, collected by recording the user-item interactions of about 20 million users from February 2015 to May 2015. The dataset provides categorized demographic information (age range, gender, and…


Link to Full Article: Yahoo releases machine learning dataset for researchers