Skip to content
R for the Rest of Us Logo

Regular Expressions and Data Cleaning, Part 2

This lesson is locked

Get access to all lessons in this course.

If the video is not playing correctly, you can watch it in a new window


Click on the transcript to go to that point in the video. Please note that transcripts are auto generated and may contain minor inaccuracies.

Your Turn

  1. Download CRAN package descriptions using the tools package

  2. Select package name, author, description, and all variables that end in ‘ports’

  3. Filter rows for packages with names that:

  • end in plot

  • contain Bayes

  • contain digits

  • are all UPPER CASE

Learn More

To learn more about groups and back references, check out the website and Chapter 1 of the book Mastering Software Development in R.

Two (paid) books that are also useful are Regular Expressions Cookbook and Mastering Regular Expressions.

Have any questions? Put them below and we will help you out!

You need to be signed-in to comment on this post. Login.

Alberto Cabrera

Alberto Cabrera

January 18, 2024

Luis Verde mentioned in this video he was going to list additional references on regexp at the end of the section on Data Cleaning. I could not find it.

David Keyes

David Keyes Founder

January 19, 2024

Can you clarify (ideally with a time stamp?) where he mentioned this?