Ryan Womack, Data Librarian
Rutgers University - New Brunswick Libraries
https://ryanwomack.com
https://mastodon.social/@ryandata/
Screencast version of a workshop at Rutgers University.
This series describes working with a large dataset in R and applying basic machine learning analysis, in the context of working on the Amarel HPC cluster at Rutgers.
Part 5 covers the use of the caret package for train/test split, cross-validation, and modeling.
[Sorry audio breaks up a bit at the end of this video. Please continue on to Part 6.]
Related R materials at
https://libguide.rutgers.edu/data_R
Code at
https://github.com/ryandata/AmarelR