Skip to content

Loretta C. Duckworth Scholars Studio

⠀

Menu
  • Scholars Studio Blog
    • Digital Methods
      • coding
      • critical making
      • data visualization
      • digital pedagogy
      • immersive technology (AR/VR)
      • mapping
      • textual analysis
      • web scraping
    • Disciplinary Fields
      • Anthropology
      • Archaeology
      • Architecture
      • Art History
      • Business
      • Computer Science
      • Critical Digital Studies
      • Cultural Studies
      • Dance
      • Economics
      • Education
      • Environmental Studies
      • Film Studies
      • Gaming Studies
      • Geography
      • History
      • Information Science
      • Linguistics
      • Literary Studies
      • Marketing
      • Media and Communication Studies
      • Music Studies
      • Political Science
      • Psychology
      • Public Health
      • Sculpture
      • Sociology
      • Urban Studies
      • Visual Art
    • Cultural Analytics Practicum Blogposts
  • Current Staff
  • Newsletter
  • About
    • Games Group 
Menu
Towers of cards fill the floor at Gen Con with several of them spelling out GEN CON.

Reflections on TUScholarShare’s First Dataset: Gen Con Programs

Posted on December 3, 2025December 3, 2025 by Matt Shoemaker

By Will Dean and Matt Shoemaker


In 2020, Temple Libraries launched the university’s first institutional repository, TUScholarShare, with an integrated research data collection and deposit service. The first deposit to the collection was a dataset collected by Matt Shoemaker, Head of the Loretta C. Duckworth Scholars Studio. Five years is a long time in the academic world – around how long an undergraduate degree often takes – and the last few have contained more than their fair share of events so we wanted to celebrate this milestone with a look at our first dataset. 

A Wizard stands beneath the compass rose log for Gen Con 13

The original “Gen Con Programs” dataset contained records from all events held at the Gen Con gaming convention from 1968 to 2017 (and a recent update brings the dataset up to 2025).  Gen Con is the largest, longest running, and one of the oldest analog game conventions in the world.  Today, the 4-day event hosts more than 20,000 gaming events each year for more than 70,000 attendees, and this dataset allows researchers to see how analog gaming has changed since 1968.  What games were most popular, how they were described, how many people could play them, and more is all contained within this dataset.   

The event data for Gen Con was difficult for researchers to access due to its ephemeral nature.  From 1968 to 2002 it was only available within physical printed programs, many of which are quite scarce due to many people simply throwing them away once the convention was over for the year.  The data after 2002 was captured by downloading a CSV dump of the convention’s online event catalog. This had to be done in a timely manner as it, too, is lost to the ether shortly after the convention ends. For the physical programs, staff in the Loretta C. Duckworth Scholars studio scanned each program and trained ABBY FineReader to use OCR to extract and format the event data.  The spreadsheets then underwent minimal cleaning to make sure their columns matched across the years before being compiled and submitted for deposit to TUScholarShare. 

A crowd of people make their way between the booths that make up one aisle of the exhibit hall at Gen Con.

The Libraries’ Research Data Services (RDS) team, which oversees data deposits to TUScholarShare, used the dataset to test our data curation workflow that was adapted from the Data Curation Network. The process involved closely examining spreadsheets within the dataset, creating new descriptive information (or metadata) to facilitate search and retrieval, and frequently communicating with the depositor. It also presented an opportunity to use some tools that were new to RDS in order to preserve this dataset openly.  

As an open access repository, TUScholarShare is committed to making its content openly available and reusable, including its file types. For ease of use, the deposit is available as an XLSX file that opens in Excel and displays multiple sheets via tabs. While the XLSX file type is ubiquitous at the moment thanks to Microsoft’s domination of the office software market, it is not an open format that anyone can freely use. The comma separated value, or CSV, filetype is the most widely used open format for textual data and we used the Excel Archival Tool, openly available under the GNU GPLv3 license, to convert the many spreadsheet tabs quickly and easily into individual CSV files. Both file versions are available in the deposit, allowing us to make it available as openly as possible. 

Over the past five years, our data deposit workflow has been refined to be clearer and more efficient, and the breadth of deposit types has grown to include materials from nine of Temple’s schools and colleges. Updating our first deposit, in collaboration with Matt, with more recent data has allowed us to reflect on how far we have come and demonstrates how data deposits are meant to be reused and updated as we learn more about the world around us through research. Check out the updated Gen Con Programs dataset and consider contributing your own work to the Research Data collection via TUScholarShare’s data deposit form. 

Recent Posts

  • Reflections on TUScholarShare’s First Dataset: Gen Con Programs December 3, 2025
  • Critical Making as a Bridge Between Technology and Empathy November 17, 2025
  • Hosting Wax Worms: 3D Modeling Ecologies in Artificial Habitats November 4, 2025

Tags

3D modeling 3D printing arduino augmented reality authorship attribution coding corpus building critical making Cultural Heritage data cleaning data visualization Digital Preservation digital reconstruction digital scholarship games gen con gephi linked open data machine learning makerspace makerspace residency mapping network analysis oculus rift omeka OpenRefine Photogrammetry physical computing Python QGIS R SketchUp stylometry text analysis text mining textual analysis top news twitter video analysis virtual reality visual analysis voyant web scraping webscraping YouTube

Recent Posts

  • Reflections on TUScholarShare’s First Dataset: Gen Con Programs December 3, 2025
  • Critical Making as a Bridge Between Technology and Empathy November 17, 2025
  • Hosting Wax Worms: 3D Modeling Ecologies in Artificial Habitats November 4, 2025

Archives

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org

Archives

Blog Tags

3D modeling (11) 3D printing (14) arduino (8) augmented reality (5) authorship attribution (3) coding (12) corpus building (4) critical making (8) Cultural Heritage (10) data cleaning (4) data visualization (11) Digital Preservation (3) digital reconstruction (9) digital scholarship (12) games (6) gen con (3) gephi (3) linked open data (4) machine learning (6) makerspace (7) makerspace residency (4) mapping (30) network analysis (17) oculus rift (8) omeka (3) OpenRefine (4) Photogrammetry (5) physical computing (3) Python (8) QGIS (10) R (9) SketchUp (4) stylometry (8) text analysis (11) text mining (4) textual analysis (32) top news (102) twitter (5) video analysis (4) virtual reality (17) visual analysis (5) voyant (4) web scraping (16) webscraping (4) YouTube (3)

Recent Posts

  • Reflections on TUScholarShare’s First Dataset: Gen Con Programs December 3, 2025
  • Critical Making as a Bridge Between Technology and Empathy November 17, 2025
  • Hosting Wax Worms: 3D Modeling Ecologies in Artificial Habitats November 4, 2025
  • Asexuality in TV and Film: Visualizing the Invisible Orientation in Online Spaces October 28, 2025
  • Spring 2026 Advanced Digital Tools Faculty Learning Community Applications Now Open! October 27, 2025

Archives

©2025 Loretta C. Duckworth Scholars Studio | Design: Newspaperly WordPress Theme