Introduction to the Dataset
This lesson is called Introduction to the Dataset, part of the Inferential Statistics with R course. This lesson is called Introduction to the Dataset, part of the Inferential Statistics with R course.
Transcript
Click on the transcript to go to that point in the video. Please note that transcripts are auto generated and may contain minor inaccuracies.
Your Turn
You’ll be working with the college dataset to run all your analyses.
Create a new project. Make sure you put it somewhere you’ll be able to find it again later!
Download the dataset “college.csv” from https://bit.ly/college-dataset
Create a new R script file or RMarkdown document where you’ll do all of your inferential statistics. Alternatively, download the blank exercises document.
Import the spreadsheet into a dataframe
college
usingreadr::read_csv()
Learn More
To learn more about the summarytools
package, check out its vignette.
The grade_class
variable in the dataset is not a factor, but could become one. To learn about factors, check out Chapter 15 of R for Data Science.
Have any questions? Put them below and we will help you out!
Course Content
21 Lessons
You need to be signed-in to comment on this post. Login.
Gaurav Gupta • October 15, 2021
Hi- the first two links in the 'Learn More' section don't work or exist anymore. Thanks, Gaurav
Gaurav Gupta • October 16, 2021
Thanks David- this package (summarytools) has turned out to be one of the most powerful tools I have ever used in R. I have been struggling with using 'weights' for analysing survey data in R and recently came across the 'pollster' package. summarytools can do far more than pollster can and brings R almost at par with what STATA in terms of comprehensive summary stats by with or without incorporating weights.
Brenda Domeika Ponsot • July 14, 2022
Hello - library(summarytools) will not run for me. The error message: Error in library(summarytools) : there is no package called ‘summarytools’