Skip to content

R in 3 Months (Fall 2022)

Tidy Data

This lesson is called Tidy Data, part of the R in 3 Months (Fall 2022) course. This lesson is called Tidy Data, part of the R in 3 Months (Fall 2022) course.

Transcript

Click on the transcript to go to that point in the video. Please note that transcripts are auto generated and may contain minor inaccuracies.

Your Turn

Read the Tidy Data vignette
Take a look at your data and see which principles of tidy data it violates

Learn More

In the video, I only talk about two types of data tidying: each variable forming a column and each type of observational unit forming a table. If you want to see examples of the third type (each observation forming a row), check out the tidy data vignette from the tidyr package.

The workflow diagram I talked about is from Chapter 1 of R for Data Science.

Tidy data worfklow.

One small note unrelated to the main content of this lesson: I recorded it before dplyr 1.0 was released. If you have this version of dplyr installed, you have access to the across() function, which enables you to do summaries across rows. My example of finding it challenging to summarize German speakers data across rows would be much easier using the across() function. However, I still think that in most cases, it is easier to tidy your data and work with it in that format.

Have any questions? Put them below and we will help you out!

You need to be signed-in to comment on this post. Login.

Vuk Sekicki • April 19, 2021

Hello David,

Could you help me out understanding this: names_pattern = "(.)(.+)"

Specifically what is "(.)(.+)"

Thanks.

Matt M • November 9, 2021

I see you re-worded the 3 rules of tidy data from the vignette. Although I think I understand conceptually what is being sought, I'm not sure I follow what each rule means (i.e., what I need to do to make sure that I'm complying with the rule) and what a violation of each rule looks like (the third rule in particular)

Course Content

142 Lessons

Welcome to Getting Started with R

Install RStudio

Objects and Functions

Examine our Data

Import Our Data Again

R in 3 Months Fall 2022 - Introductions thread!

R in 3 Months Fall 2022 Week 1 Live Session

Welcome to Fundamentals of R

RMarkdown Overview

R in 3 Months Fall 2022 Week 2 Project Assignment

R in 3 Months Fall 2022 Week 2 Drop-in Session

R in 3 Months Fall 2022 Week 2 Live Session

Getting Started

Create a New Data Frame

R in 3 Months Fall 2022 Week 3 Project Assignment

R in 3 Months Fall 2022 Week 3 Drop-in Session

R in 3 Months Fall 2022 Week 3 Live Session

An Important Workflow Tip

The Grammar of Graphics

Text and Labels

R in 3 Months Fall 2022 Week 4 Project Assignment

R in 3 Months Fall 2022 Week 4 Drop-in Session

R in 3 Months Fall 2022 Week 4 Live Session

R in 3 Months Fall 2022 Week 5 Project Assignment

R in 3 Months Fall 2022 Week 5 Drop-in Session

Welcome, Logistics, Course Materials, and Additional Resources

What is Git? What is GitHub?

Why Should You Learn to Use Git and GitHub?

Update Everything

Create a Local Git Repository

GitHub Repositories

Connect RStudio and GitHub

Push an RStudio Project to a GitHub Repository

Pull a GitHub Repository to an RStudio Project

Keep RStudio and GitHub in Sync

R in 3 Months Fall 2022 Week 6 Project Assignment

R in 3 Months Fall 2022 Week 6 Drop-in Session

R in 3 Months Fall 2022 Week 6 Live Session

Dealing with Missing Data

Changing Variable Types

Advanced Variable Creation

Advanced Summarizing

Binding Data Frames

R in 3 Months Fall 2022 Week 7 Drop-in Session

R in 3 Months Fall 2022 Week 7 Live Session

Renaming Variables

Quick Interlude to Reorganize our Code

R in 3 Months Fall 2022 Week 8 Project Assignment

R in 3 Months Fall 2022 Week 8 Drop-in Session

R in 3 Months Fall 2022 Week 8 Live Session

Data Visualization Best Practices

Pipe Data Into ggplot

Reorder Plots to Highlight Findings

Use Color to Highlight Findings

Use the scales Package for Nicely Formatted Values

Use Direct Labeling

R in 3 Months Fall 2022 Week 9 Project Assignment

R in 3 Months Fall 2022 Week 9 Drop-in Session

R in 3 Months Fall 2022 Week 9 Live Session

R in 3 Months Fall 2022 Week 10 Drop-in Session

Use Axis Text Wisely

Use Titles to Highlight Findings

Use Color in Titles to Highlight Findings

Use Annotations to Explain

Customize Your Theme

Customize Your Fonts

Try New Plot Types

R in 3 Months Fall 2022 Week 11 Project Assignment

R in 3 Months Fall 2022 Week 11 Drop-in Session

R in 3 Months Fall 2022 Week 11 Live Session

Advanced Markdown Text Formatting

Making Your Reports Shine: Word Edition

Making Your Reports Shine: HTML Edition

Making Your Reports Shine: PDF Edition

R in 3 Months Fall 2022 Week 12 Drop-in Session

R in 3 Months Fall 2022 Week 12 Live Session

R in 3 Months Final Project Assignment

R in 3 Months Fall 2022 Week 13 Live Session

All R in 3 Months Fall 2022 Videos

Reading documentation pages

Working with file paths and RStudio Projects

Styling RMarkdown docs

Structuring large projects (and dealing with slow knitting of Rmd files)

Quarto vs RMarkdown

How to get lesson and lecture slides

{lubridate} for working with dates and times

Statistical Tests