Course Notes and Assignments

Spring 2021
Tuesdays & Thursdays, 11h00 > 12h15 and 14h30 > 15h45

Instructor: Taylor Arnold
E-mail: tarnold2@richmond.edu
Method of Instruction: ONLINE
Office Hours: Tuesdays and Thursdays, Following Class; 19h-20h, Evening before projects due
Syllabus: Syllabus
Groups: Groups
Review: Using R to Manipulate and Visualize Data

Zoom Meeting ID: 958 4808 3217 (password in email)
RStudio: Class Workspace

Core Material — Building Predictive Models

DateTopicLinks
2021-01-19 Course Introduction and Setup [Introduction]
[Survey]
[R and RStudio Setup]
[RStudio Cloud Video]
[course code]
2021-01-21 01. The Language of Predictive Models [Notebook 01]
[Lab 01 (solutions)]
[Handout 01 (optional)]
2021-01-26 02. Creating the Model Matrix [Notebook 02]
[Lab 02 (solutions)]
[Handout 02 (optional)]
2021-01-28 03. Classification and Logistic Regression [Notebook 03]
[Lab 03 (solutions)]
[Handout 03 (optional)]
2021-02-02 04. Penalized Regression: Lasso and Elastic Net [Notebook 04]
[Lab 04 (solutions)]
[Handout 04 (optional)]
2021-02-04 05. Cross-Validation and Multinomial Regression [Notebook 05]
[Lab 05 (solutions)]
2021-02-09 No Class (University-Wide Day Off)
2021-02-11 06. Text Prediction [Notebook 06]
[Lab 06 (solutions)]
2021-02-16 07. Natural Language Processing [Notebook 07]
[Lab 07 (solutions)]
2021-02-18 08. Text Analysis Pipeline [Notebook 08]

Project 01 — IMDb (Reviews)

DateTopicLinks
2021-02-23 Project 01 Workshop
2021-02-25 09. More Model Features [Notebook 09]
2021-03-02 Project 01 Workshop (light week)
2021-03-04 Project 01 Workshop (light week)
2021-03-09 Project 01 Due — Presentations [Project 01]

Project 02 — Amazon Product Reviews (Authorship)

DateTopicLinks
2021-03-11 10. Mid-Course Thoughts [Notebook 10]
2021-03-16 11. Local Models: KNN [Notebook 11]
2021-03-18 12. Local Models: GBM [Notebook 12]
2021-03-23 Project 02 Due — Presentations [Project 02]

Project 03 — Yelp Reviews (Authorship/Clustering)

DateTopicLinks
2021-03-25 13. Unsupervised Learning [Notebook 13]
2021-03-30 Project 03 Workshop
2021-04-01 14. Project Three Ideas / Examples (No Class) [Notebook 14]
2021-04-06 15. Linear Regression vs. PCA [Notebook 15]
2021-04-08 Project 03 Due — Presentations [Project 03]

Project 04 — Wikipedia (Topics)

DateTopicLinks
2021-04-13 16. Latent Dirchlet Allocation (LDA) [Notebook 16]
2021-04-15 Project 04 Workshop
2021-04-20 Project 04 Workshop [Class Evaluation]
2021-04-22 Project 04 Due — Presentations [Project 04]

Self Assessment

DateTopicLinks
2021-04-27 Self Assessments Due [Self-Assessment Instructions]