Skip to content

Loretta C. Duckworth Scholars Studio

⠀

Menu
  • Scholars Studio Blog
    • Digital Methods
      • coding
      • critical making
      • data visualization
      • digital pedagogy
      • immersive technology (AR/VR)
      • mapping
      • textual analysis
      • web scraping
    • Disciplinary Fields
      • Anthropology
      • Archaeology
      • Architecture
      • Art History
      • Business
      • Computer Science
      • Critical Digital Studies
      • Cultural Studies
      • Dance
      • Economics
      • Education
      • Environmental Studies
      • Film Studies
      • Gaming Studies
      • Geography
      • History
      • Information Science
      • Linguistics
      • Literary Studies
      • Marketing
      • Media and Communication Studies
      • Music Studies
      • Political Science
      • Psychology
      • Public Health
      • Sculpture
      • Sociology
      • Urban Studies
      • Visual Art
    • Cultural Analytics Practicum Blogposts
  • Current Staff
  • Newsletter
  • About
    • Games Group 
Menu

The myth of exhaustivity

Posted on September 9, 2015August 26, 2019 by Gerald Doyle

By Gerald Doyle

AutobioBibWorksExcluded

I came across the “Works Excluded” statement pictured above as I started using what is thought of as the definitive bibliography of pre-1945 US autobiography in existence to help build up my corpus of public domain early 20c immigrant narrative. The editor’s forthright statement of what he was not collecting struck me as provocative in two ways.

First, as a life writing scholar, it struck me that this definition of American autobiography leaves out huge swaths of what scholars have found most interesting and important about the field, such as “most episodic accounts, such as those relating to Indian captivities” and “works commonly recognized as fictional even when the factual element is strong.” Both captivity narratives and autobiographical hoaxes have been the source of field-defining work, especially relative to women’s life narrative. As well, for my purposes, these two exclusions indicate to me that I’m going to have to continue to look for bibliographic guidance to make sure my corpus ends up as complete as it can be at this time and that my broader research interest in the relationship between data and narrative lends itself to exactly the types of life narratives that are deemed non-autobiographical here.

Second, as a humanist integrating computationally-enabled methods, it struck me as a reminder that all archives face practical, as well as political, constraints that compromise their claim to exhaustive representation of the past. The editor is clear that this compromise has shaped the text he has produced; he has systematically excluded these works in order “to prevent this bibliography from growing so large that no press could afford to publish it.”  Of course, exhaustive representation perhaps less the claim of any archive than the perception of it, a perception that is only heightened in the age of digital data. Which is, if anything, even more painfully limited than print archives. Case in point: Google has digitized about 15 million of the 129 million books ever published, and about 5 million of those have scanned textual data good enough to use for something like n-grams. Because we get a cool picture drawn from data that is, admittedly, vaster than any individual eye or mind could ever assemble, we might like to think that an n-gram tells us something meaningful about “all” of literature or “all” of 19c newspapers. And it might, but we can’t assume that. We have to acknowledge how our scholarship is shaped by our limitations.

The point of this isn’t to say that the current archive of digitized, OCR’d, and publicly mine-able texts is good enough to stop worrying about it or conceptually equivalent to the coverage of the print archive.  Instead, this seemed to me a good reminder that textual analysts, of the digital and analog bents, should strive to be forthright about our limitations–well-versed in the constraints we face, how those shape the work we are able to do, and unapologetic about the value we see in that work providing despite of those limitations.

Leave a Reply

You must be logged in to post a comment.

Recent Posts

  • The Untold History of Fletcher Street’s Stables April 21, 2025
  • Building an Immersive Archive of the Greek Orthodox Churches in Istanbul April 15, 2025
  • Tracing Influence in Genealogies of Communication Theory April 14, 2025

Tags

3D modeling 3D printing arduino augmented reality authorship attribution banned books coding corpus building critical making Cultural Heritage data cleaning data visualization Digital Preservation digital reconstruction digital scholarship film editing games gephi GIS linked open data machine learning makerspace makerspace residency mapping network analysis oculus rift omeka OpenRefine Photogrammetry Python QGIS R SketchUp stylometry text analysis text mining textual analysis top news twitter video analysis virtual reality visual analysis voyant web scraping webscraping

Recent Posts

  • The Untold History of Fletcher Street’s Stables April 21, 2025
  • Building an Immersive Archive of the Greek Orthodox Churches in Istanbul April 15, 2025
  • Tracing Influence in Genealogies of Communication Theory April 14, 2025

Archives

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org

Archives

Blog Tags

3D modeling (8) 3D printing (13) arduino (8) augmented reality (5) authorship attribution (3) banned books (3) coding (12) corpus building (4) critical making (7) Cultural Heritage (10) data cleaning (4) data visualization (11) Digital Preservation (3) digital reconstruction (9) digital scholarship (11) film editing (3) games (6) gephi (3) GIS (3) linked open data (4) machine learning (6) makerspace (7) makerspace residency (4) mapping (30) network analysis (17) oculus rift (8) omeka (3) OpenRefine (4) Photogrammetry (5) Python (8) QGIS (10) R (9) SketchUp (4) stylometry (8) text analysis (10) text mining (4) textual analysis (32) top news (98) twitter (5) video analysis (4) virtual reality (15) visual analysis (5) voyant (4) web scraping (16) webscraping (3)

Recent Posts

  • The Untold History of Fletcher Street’s Stables April 21, 2025
  • Building an Immersive Archive of the Greek Orthodox Churches in Istanbul April 15, 2025
  • Tracing Influence in Genealogies of Communication Theory April 14, 2025
  • From Theory to Practice: Weaving in Response to the Grid in the Global Context March 26, 2025
  • Visiting a Land of Twilight February 24, 2025

Archives

©2025 Loretta C. Duckworth Scholars Studio | Design: Newspaperly WordPress Theme