Get access to all lessons in this course.
-
Welcome to Data Cleaning with R
- What is Data Cleaning?
- Course Logistics and Materials
-
Data Organization
- Data Organization Best Practices
- Tidy Data
- Grouping and Indicator Variables
- NA and Empty Values
- Data Sharing Best Practices
-
Restructuring Data
- Tidyverse Refresher
- Working with Columns with across()
- Pivoting Data
- coalesce() and fill()
-
Regular Expressions
- What are Regular Expressions?
- Understanding and Testing Regular Expressions
- Literal Characters and Metacharacters
- Metacharacters: Quantifiers
- Metacharacters: Alternation, Special Sequences, and Escapes
- Combining Metacharacters
- Regex in R
- Regular Expressions and Data Cleaning, Part 1
- Regular Expressions and Data Cleaning, Part 2
-
Common Issues
- Common Issues in Data Cleaning
- Unusable Variable Names
- Whitespace
- Letter Case
- Missing, Implicit, or Misplaced Grouping Variables
- Compound Values
- Duplicated Values
- Broken Values
- Empty Rows and Columns
- Parsing Numbers
- Putting Everything Together
Data Cleaning with R
Regular Expressions and Data Cleaning, Part 2
This lesson is locked
This lesson is called Regular Expressions and Data Cleaning, Part 2, part of the Data Cleaning with R course. This lesson is called Regular Expressions and Data Cleaning, Part 2, part of the Data Cleaning with R course.
If the video is not playing correctly, you can watch it in a new window
Transcript
Click on the transcript to go to that point in the video. Please note that transcripts are auto generated and may contain minor inaccuracies.
Your Turn
Download CRAN package descriptions using the
tools
packageSelect package name, author, description, and all variables that end in ‘ports’
Filter rows for packages with names that:
end in plot
contain Bayes
contain digits
are all UPPER CASE
Learn More
To learn more about groups and back references, check out the website regular-expressions.info and Chapter 1 of the book Mastering Software Development in R.
Two (paid) books that are also useful are Regular Expressions Cookbook and Mastering Regular Expressions.
You need to be signed-in to comment on this post. Login.
Alberto Cabrera
January 18, 2024
Luis Verde mentioned in this video he was going to list additional references on regexp at the end of the section on Data Cleaning. I could not find it.
David Keyes Founder
January 19, 2024
Can you clarify (ideally with a time stamp?) where he mentioned this?