Pentaho adds native Python integration

Aiming to better support machine learning and analytical environments, Pentaho Labs yesterday announced that it has developed a native integration for the Python language through Pentaho Data Integration (PDI). PDI is essentially a portable “data machine” for ETL, which you can deploy as a stand-alone Pentaho cluster or inside a Hadoop cluster through MapReduce or YARN. Will Gorman, vice president of Pentaho Labs at Hitachi subsidiary Pentaho, says the integration means data scientists can now use of the most popular and flexible open-source languages to increase productivity and data governance while supporting predictive analytics and machine learning. He says the integration will also make data science and predictive modeling more accessible to the developer community. [ Related: 5 legacy technologies still in high demand today] “Python is the environment that‚Ķ


Link to Full Article: Pentaho adds native Python integration