It is often that said that those who work with data spend at least half of their time cleaning it. Beautiful visualizations and intricately designed reports often hide the hours and hours of work spent getting data into the right format for analysis. Data does not speak for itself — at least not without some serious cleaning beforehand.
Data cleaning skills are essential to your success. Fortunately, R has some great packages to help with data cleaning. Unfortunately, knowing what these packages are and how to use them is not straightforward.
This course will help you learn how to take messy data and quickly clean it. You’ll learn to:
Get your data into tidy format
Use regular expressions to deal with complex character data
Work with missing data
Identify and deal with duplicate values
And much, much more!
The 31 lessons in this course take you from the high-level thinking about best practices in data organization to the nitty gritty of cleaning messy data. You’ll learn both the why and the how of data cleaning. We can’t promise that you’ll have less data cleaning to do, but this course will make sure that the data cleaning you do is faster and more efficient than ever before.