- Jan 2023
Seven Principles of Data Feminism
- Examine power
- Challenge power
- Rethink binaries and hierarchies
- Elevate emotion an embodiment
- Embrace pluralism
- Consider context
- Make labor visible
There are some interesting analogies to be drawn between the abolitionist movement in the 1800s and modern day movements like abolition of police and racial justice, etc.
Topic modeling - What would topic modeling look like for corpuses of commonplace books? Over time?
wrt article: Soni, Sandeep, Lauren F. Klein, and Jacob Eisenstein. “Abolitionist Networks: Modeling Language Change in Nineteenth-Century Activist Newspapers.” Journal of Cultural Analytics 6, no. 1 (January 18, 2021). https://doi.org/10.22148/001c.18841. - Brings to mind the difference in power and invisible labor between literate societies and oral societies. It's easier to erase oral cultures with the overwhelm available to literate cultures because the former are harder to see.
How to find unbiased datasets to study these?
aspirational abolitionism driven by African Americans in the 1800s over and above (basic) abolitionism
- data science
- emotional labor
- power frameworks
- defunding police
- Catherine D'Ignazio
- Lauren F. Klein
- invisible labor
- topic modeling
- orality vs. literacy
- Data Feminism
- intersectional feminism
- dodging the memory hole
- aspirational abolitionism
Looking at three broad ideas with examples of each to follow: - signals - patterns - pattern making, pattern breaking
Proceedings of the Old Bailey, 1674-1913
Jane Kent for witchcraft
250 years with ~200,000 trial transcripts
Can be viewed as: - storytelling, - history - information process of signals
All the best trials include the words "Covent Garden".
Example: 1163. Emma Smith and Corfe indictment for stealing.
19:45 Norbert Elias. The Civilizing Process. (book)
Prozhito: large-scale archive of Russian (and Soviet) diaries; 1900s - 2000s
How do people understand the act of diary-writing?
a convenient way to evaluate the self
a means to see, with reassuring clarity [...] the changes which you constantly suffer.
a kindly blankfaced old confidante
Diary entries in five categories - spirit - routine - literary - material form (talking about the diary itself) - interpersonal (people sharing diaries)
Are there specific periods in which these emerge or how do they fluctuate? How would these change between and over cultures?
The pattern of talking about diaries in this study are relatively stable over the century.
pre-print available of DeDeo's work here
Pattern making, pattern breaking
Individuals, institutions, and innovation in the debates of the French Revolution
- transcripts of debates in the constituent assembly
the idea of revolution through tedium and boredom is fascinating.
speeches broken into combinations of patterns using topic modeling
(what would this look like on commonplace book and zettelkasten corpora?)
emergent patterns from one speech to the next (information theory) question of novelty - hi novelty versus low novelty as predictors of leaders and followers
Robespierre bringing in novel ideas
How do you differentiate Robespierre versus a Muppet (like Animal)? What is the level of following after novelty?
Four parts (2x2 grid) - high novelty, high imitation (novelty with ideas that stick) - high novelty, low imitation (new ideas ignored) - low novelty, high imitation - low novelty, low imitation (discussion killers)
Could one analyze television scripts over time to determine the good/bad, when they'll "jump the shark"?
Hermeneutic circle In traditional humanities scholarship, the hermeneutic circle refers to the way in which we understand some part of a text in terms of our ideas about its overall structure and meaning -- but that we also, in a cyclic fashion, update our beliefs about the overall structure and meaning of a text in response to particular moments.
a common technique in natural language processing is to operationalize certain semantic concepts (e.g., "synonym") in terms of syntactic structure (two words that tend to occur nearby in a sentence are more likely to be synonyms, etc). This is what word2vec does.
Can I use some of these sorts of methods with respect to corpus linguistics over time to better identified calcified words or archaic phrases that stick with the language, but are heavily limited to narrower(ing) contexts?
Operationalization Turning ideas into something we can measure off a data set.
- hermeneutic circle
- information theory
- calcified words
- natural language processing
- digital humanities
- open questions
- historical linguistics
- archaic phrases
- updating beliefs
- Bayesian inference