Yahoo Releases Free 13.5TB Data Set for Academia

The move should help academic researchers create better software that gives you the content you most want to see. Yahoo has released a gigantic, 13.5TB machine-learning data set for the academic research community it hopes will encourage innovation (and perhaps bolster Yahoo’s own products, of course). The data set—entirely anonymized—contains around 110 billion events of interaction data taken from about 20 million Yahoo users between February and May of last year. These users all did something on the news feeds of one of Yahoo’s major sites, which includes its homepage, Yahoo News, Yahoo Sports, and Yahoo Finance. Additionally, the data set also contains information about the age range, gender, and generalized geographic data of a subset of these 20 million users. “On the item side, we are releasing the title,…


Link to Full Article: Yahoo Releases Free 13.5TB Data Set for Academia