Loretta C. Duckworth Scholars Studio

⠀

Menu
  • Scholars Studio Blog
    • Disciplinary Fields
      • Anthropology
      • Archaeology
      • Architecture
      • Art History
      • Business
      • Computer Science
      • Critical Digital Studies
      • Cultural Studies
      • Dance
      • Economics
      • Education
      • Environmental Studies
      • Film Studies
      • Gaming Studies
      • Geography
      • History
      • Information Science
      • Linguistics
      • Literary Studies
      • Marketing
      • Media and Communication Studies
      • Music Studies
      • Political Science
      • Psychology
      • Public Health
      • Sculpture
      • Sociology
      • Urban Studies
      • Visual Art
    • Digital Methods
      • coding
      • critical making
      • data visualization
      • digital pedagogy
      • immersive technology (AR/VR)
      • mapping
      • textual analysis
      • web scraping
  • About
    • Current Staff
    • Current Fellows
    • Faculty Fellowships
    • Graduate Extern Program
Menu

The myth of exhaustivity

Posted on September 9, 2015August 26, 2019 by Gerald Doyle

By Gerald Doyle

AutobioBibWorksExcluded

I came across the “Works Excluded” statement pictured above as I started using what is thought of as the definitive bibliography of pre-1945 US autobiography in existence to help build up my corpus of public domain early 20c immigrant narrative. The editor’s forthright statement of what he was not collecting struck me as provocative in two ways.

First, as a life writing scholar, it struck me that this definition of American autobiography leaves out huge swaths of what scholars have found most interesting and important about the field, such as “most episodic accounts, such as those relating to Indian captivities” and “works commonly recognized as fictional even when the factual element is strong.” Both captivity narratives and autobiographical hoaxes have been the source of field-defining work, especially relative to women’s life narrative. As well, for my purposes, these two exclusions indicate to me that I’m going to have to continue to look for bibliographic guidance to make sure my corpus ends up as complete as it can be at this time and that my broader research interest in the relationship between data and narrative lends itself to exactly the types of life narratives that are deemed non-autobiographical here.

Second, as a humanist integrating computationally-enabled methods, it struck me as a reminder that all archives face practical, as well as political, constraints that compromise their claim to exhaustive representation of the past. The editor is clear that this compromise has shaped the text he has produced; he has systematically excluded these works in order “to prevent this bibliography from growing so large that no press could afford to publish it.”  Of course, exhaustive representation perhaps less the claim of any archive than the perception of it, a perception that is only heightened in the age of digital data. Which is, if anything, even more painfully limited than print archives. Case in point: Google has digitized about 15 million of the 129 million books ever published, and about 5 million of those have scanned textual data good enough to use for something like n-grams. Because we get a cool picture drawn from data that is, admittedly, vaster than any individual eye or mind could ever assemble, we might like to think that an n-gram tells us something meaningful about “all” of literature or “all” of 19c newspapers. And it might, but we can’t assume that. We have to acknowledge how our scholarship is shaped by our limitations.

The point of this isn’t to say that the current archive of digitized, OCR’d, and publicly mine-able texts is good enough to stop worrying about it or conceptually equivalent to the coverage of the print archive.  Instead, this seemed to me a good reminder that textual analysts, of the digital and analog bents, should strive to be forthright about our limitations–well-versed in the constraints we face, how those shape the work we are able to do, and unapologetic about the value we see in that work providing despite of those limitations.

Share this:

  • Twitter
  • Facebook
  • Reddit
  • Email

Related Posts

Leave a Reply

Your email address will not be published. Required fields are marked *

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Recent Posts

  • Digital Practices for the Study of Cultural Heritage (Part 2) April 7, 2022
  • Visualizing Changes in Colombian Wetlands with ArcGIS Story Maps March 21, 2022
  • Digital Practices for the Study of Cultural Heritage (Part 1) February 8, 2022
My Tweets

Tags

3D modeling 3D printing 360 video arduino augmented reality authorship attribution coding corpus building critical making Cultural Heritage data cleaning data visualization digital art history Digital Preservation digital reconstruction digital scholarship early modern film editing games gephi linked open data machine learning makerspace mapping network analysis oculus rift OpenRefine Photogrammetry physical computing Python QGIS R SketchUp stylometry terrain modeling text analysis text mining textual analysis top news twitter video analysis virtual reality visual analysis voyant web scraping

Archives

©2022 Loretta C. Duckworth Scholars Studio | Design: Newspaperly WordPress Theme
loading Cancel
Post was not sent - check your email addresses!
Email check failed, please try again
Sorry, your blog cannot share posts by email.