Skip to content
R for the Rest of Us Logo

Introduction to the Dataset

This lesson is locked

Get access to all lessons in this course.

If the video is not playing correctly, you can watch it in a new window

Transcript

Click on the transcript to go to that point in the video. Please note that transcripts are auto generated and may contain minor inaccuracies.

Your Turn

You’ll be working with the college dataset to run all your analyses.

  1. Create a new project. Make sure you put it somewhere you’ll be able to find it again later!

  2. Download the dataset “college.csv” from https://bit.ly/college-dataset

  3. Create a new R script file or RMarkdown document where you’ll do all of your inferential statistics. Alternatively, download the blank exercises document.

  4. Import the spreadsheet into a dataframe college using readr::read_csv()

Learn More

To learn more about the summarytools package, check out its vignette.

The grade_class variable in the dataset is not a factor, but could become one. To learn about factors, check out Chapter 15 of R for Data Science.

Have any questions? Put them below and we will help you out!

You need to be signed-in to comment on this post. Login.

Gaurav Gupta

Gaurav Gupta

October 15, 2021

Hi- the first two links in the 'Learn More' section don't work or exist anymore. Thanks, Gaurav

Gaurav Gupta

Gaurav Gupta

October 17, 2021

Thanks David- this package (summarytools) has turned out to be one of the most powerful tools I have ever used in R. I have been struggling with using 'weights' for analysing survey data in R and recently came across the 'pollster' package. summarytools can do far more than pollster can and brings R almost at par with what STATA in terms of comprehensive summary stats by with or without incorporating weights.

Brenda Domeika Ponsot

Brenda Domeika Ponsot

July 15, 2022

Hello - library(summarytools) will not run for me. The error message: Error in library(summarytools) : there is no package called ‘summarytools’