… Textual Corpora and Computational Text Analysis Project
Taught by Sarah Connell and Elizabeth Maddock Dillon in the class Literature and Digital Diversity at Northeastern. Full assignment details here.
Assignment
Develop a research question
Build or find a corpus
Prepare the corpus (select specific texts or portions of texts, remove metadata, etc.)
Train and query at least one model
Write up results in a research blog post
Discuss at least one scholarly source in the post
Scaffolding
Previous assignment: text analysis blog post project using web-based tools to compare
versions of a historical narrative
In-class workshops on: R and RStudio, running word2vec code, building and preparing
corpora, developing research questions, developing queries, writing research blog
posts