Please use D2L to turn in both the HTML output and your R Markdown file in.
Define data wrangling and discuss why it is an important element in the data analysis or data visualization process.
Implement one function from the dplyr package on a dataset we have used in class (such as http://math.montana.edu/ahoegh/teaching/stat408/datasets/SeattleHousing.csv or http://math.montana.edu/ahoegh/teaching/stat408/datasets/HousingSales.csv ) and describe what this procedure is doing.
Describe the difference between substr()
and strsplit()
.
Use one of these function to create a new variable for the hour an Uber ride began using http://math.montana.edu/ahoegh/teaching/stat408/datasets/UberMay2014.csv.
Then apply the count()
function from dplyr to compute the number of rides starting at each hour