What is Data Cleaning?
This lesson is called What is Data Cleaning?, part of the Data Cleaning with R course. This lesson is called What is Data Cleaning?, part of the Data Cleaning with R course.
Transcript
Click on the transcript to go to that point in the video. Please note that transcripts are auto generated and may contain minor inaccuracies.
Your Turn
Reflect on the amount (if any) of data cleaning you perform for your day-to-day work.
Learn More
Randy Au's article, Data Cleaning is Analysis, Not Grunt Work, makes a similar point.
The tweet thread below (click here for the original) argues that data cleaning is part of the analysis process.
Data scientists often complain that the bulk of their work is data cleaning.
— Data Science Fact (@DataSciFact) January 12, 2021
But if you see data cleaning as the work, not just an obstacle to the work, it can be interesting.
You could think of it as data pathology, a kind of analysis before the intended analysis.
Have any questions? Put them below and we will help you out!
Course Content
32 Lessons
You need to be signed-in to comment on this post. Login.