Today, we'll cover an essential component of working with R: how to import your data into R! We'll do so with functions from one of the core tidyverse packages: readr.
Today, we'll start with the R4DS chapter on "tidy" data, learning what it means for a dataframe to be in a tidy format, and how to reshape untidy into tidy data.
We'll continue with the R for Data Science book in the 5th season of Code Club, now switching to the brand-new second edition! We'll talk about pipes in R, which is covered in the short Chapter 5 of the book.
We continue with this chapter on Exploratory Data Analysis (EDA), now covering missing values (and the ifelse function to turn outliers into missing values) and covariation among variables, with a couple of new plot types: boxplots and heatmaps.
This chapter covers so-called Exploratory Data Analysis (EDA): computing summary stats and especially making quick plots to explore the variation in and distributions of single variables (this session), and looking at covariation among variables (next session).
Today we will continue to investigate the summarize() function. Together with group_by(), this function is extremely useful to produce summary statistics of your data by group.