Due Date: 30 November 2022 (start of class)
Data is Plural is a weekly newsletter that describes and links to a set o “useful/curious datasets” that have been posted online. Each newsletter gives descriptions and links to around 4-6 datasets. In the list at the bottom of this page, each student has been assigned one of these newsletters.
For the final project, you will need to select one of the datasets from your assigned newsletter. I often use these as sources for data (particularly the exams); if you happen to see a dataset that we used in class, please do not choose it. If you find none of the datasets usable for the project, please let me know and I will assign you an alternative.
With your chosen dataset, you need to do three things: (1) complete the Data Feminism adapted version of a datasheet and (2) make an attempt to read the dataset into R and do something with the data, and (3) prepare one or two slides to present in class. You need to print out a copy of items (1) and (2) to bring to the last day of class. We’ll explore each other’s projects using a poster-session style setup with regards to (3), which you will share on your computer.
For the datasheet, you need to produce a document with the following eight sections. The requirements are slightly reduced from the notes in class in order to be respectful of your time. It is possible that you won’t have all of the information available to complete all of these sections. That’s okay; just explain that in your document.
You should aim for your datasheet to be between two and four pages long.
The second element of the project is try to read the data (or some subset of it) into R and do something with it. This will usually be the creation of 2-3 plots. These should include a full set of labels and a few sentences of context. You can do this as a knit RMarkdown file.
I hope that most students will be able to read some of their data into R. If you are having trouble, please ask for help as soon as possible. In the event that it is truly not possible, in lieu of this analysis, write up a description of the challenges you had and what would need to be fixed. This should only be done after consultation with the instructor.
Finally, for the presentation day, you should prepare slides that you will display on your computer. The slide should include:
It’s probably easiest to make these in PowerPoint or Google Slides. To give you an example of what this should look like, check on this one from the that I prepared from the exam data: example slides. If you have more plots that you’d like to show, you are welcome to include additional slide(s).
I assign grades to the final project holistically, based primarily on the effort you put into the project. To earn an A, you need only to have made an honest attempt at completing all of the elements as best as possible. The best way to do this is to start early so that you can come to the project workshops with meaningful questions.
Your assigned newsletter can be found by consulting the following chart:
Section 1 | Section 2 | Link |
---|---|---|
Elliot | Mark | [Week of 2022-04-20] |
Jack B. | Will | [Week of 2022-04-27] |
Mike | Yasmin | [Week of 2022-05-04] |
Jaden | Shira | [Week of 2022-05-11] |
Ditrick | Arju | [Week of 2022-05-18] |
Bailey | Ginny | [Week of 2022-05-25] |
Josh | Jordann | [Week of 2022-06-01] |
Travis | Sydney | [Week of 2022-06-08] |
Andrew | Yingzheng | [Week of 2022-06-15] |
Mads | Lu | [Week of 2022-06-22] |
Jack D. | Riley | [Week of 2022-06-29] |
Pengxu | Aris | [Week of 2022-07-06] |
Grace | Kyle | [Week of 2022-07-13] |
Mamnuya | Akhil | [Week of 2022-07-20] |
Justin | Becca | [Week of 2022-07-27] |
Tolya | Yixuan | [Week of 2022-08-03] |
Charles | Juneseo | [Week of 2022-08-10] |
Katherine | Sophie | [Week of 2022-08-17] |
Joel | Jiayi | [Week of 2022-08-24] |
Jack Z. | Yanran | [Week of 2022-08-31] |
Kamryn | Leyao | [Week of 2022-09-07] |
Elmer | Angela | [Week of 2022-09-14] |
Kathryn | Leah | [Week of 2022-09-21] |
Yibing | Thomas | [Week of 2022-09-28] |
Yueyi | [Week of 2022-10-05] |