Providing Digital Provenance: from Modeling through Production

At last week’s useR! R User conference, I spoke on digital provenance, the importance of reproducible research, and how Domino has solved many of the challenges faced by data scientists when attempting this best practice. More on the topic, and a recording of the talk, below. What are you doing to ensure that you’re mitigating the many risks associated with provenance (or lack thereof)? Reproducibility is important throughout the entire data science process. As recent studies have shown, subconscious biases in the exploratory analysis phase of a project can have vast repercussions over final conclusions. The problems with managing the deployment and life-cycle of models in production are vast and varied, and often reproducibility stops at the level of the individual analyst. Though R has best in class support for…


Link to Full Article: Providing Digital Provenance: from Modeling through Production