Automating your data clean up

By Matt Yancey

Elevator Pitch

Data scientist’s knows that building a model is only 20% of their time, but cleaning the data is 80%. Yet many of the automation tools focus on the modeling (e.g., DataRobot). I’ll demo some tools and tips for automating the process of data profiling and cleaning.

Description

I’ll talk about the following: * The importance of data profiling * Common mistakes * Useful tools * Tips for getting the most out of it