Microsoft Open Sources Distributed Machine Learning Toolkit For Easier Big Data Research

The Microsoft Distributed Machine Learning Toolkit (DMTK) has been made open source by the vendor’s Asia research team. The DMTK will make machine learning tasks on big data more scalable and efficient with a smaller cluster of computers. This is particularly useful for machine learning researchers and developers that work with large datasets. Distributed machine learning involves stringing a number of computers together to solve complex problems. Here’s what the DMTK contains: DMTK framework, which is a parameter server that supports storing a hybrid data-structure model. Two distributed machine learning algorithms, which, according to Microsoft, “can be used to train the fastest and largest topic model and largest word-embedded model in the world”. APIs to reduce the barrier of entry for distributed machine learning “so researchers and developers can focus…


Link to Full Article: Microsoft Open Sources Distributed Machine Learning Toolkit For Easier Big Data Research