R Studio
Explore the airquality dataset by performing the following steps
Calculate the number of many variables, size of the data, …
Is this Big Data?
Calculate summary statistics
Is there missing data? If so, which variables are most affected?
Visualize each variable individually
Are there outliers?
Should any variable(s) be converted to a factor? If so, which ones?
Convert these to a factor
Visualize any 5 pairs of variables of the airquality dataset
Choose the appropriate plot for each pair