Lesson 2 of 20
Introduction to the Dataset
You’ll be working with the college dataset to run all your analyses.
- Create a new project. Make sure you put it somewhere you’ll be able to find it again later!
- Download the dataset “college.csv” from https://bit.ly/college-dataset
- Create a new R script file or RMarkdown document where you’ll do all of your inferential statistics. Alternatively, download the blank exercises document.
- Import the spreadsheet into a dataframe
To learn more about the
summarytools package, check out its vignette.
grade_class variable in the dataset is not a factor, but could become one. To learn about factors, check out Chapter 15 of R for Data Science.
Have any questions? Put them below and we will help you out!
Hi- the first two links in the ‘Learn More’ section don’t work or exist anymore. Thanks, Gaurav
Thanks! I updated one deleted the other, which doesn’t appear to exist anymore.
Thanks David- this package (summarytools) has turned out to be one of the most powerful tools I have ever used in R. I have been struggling with using ‘weights’ for analysing survey data in R and recently came across the ‘pollster’ package. summarytools can do far more than pollster can and brings R almost at par with what STATA in terms of comprehensive summary stats by with or without incorporating weights.
Glad it’s so helpful!
Hello – library(summarytools) will not run for me. The error message: Error in library(summarytools) :
there is no package called ‘summarytools’
Have you installed the summarytools package? If not, you’ll need to do that first and then rerun your code.