{"id":9873,"date":"2026-03-27T15:29:04","date_gmt":"2026-03-27T19:29:04","guid":{"rendered":"https:\/\/sites.temple.edu\/tudsc\/?p=9873"},"modified":"2026-03-31T11:41:13","modified_gmt":"2026-03-31T15:41:13","slug":"building-a-metadata-for-asexual-representation-in-media","status":"publish","type":"post","link":"https:\/\/sites.temple.edu\/tudsc\/2026\/03\/27\/building-a-metadata-for-asexual-representation-in-media\/","title":{"rendered":"Building a Metadata for Asexual Representation in Media"},"content":{"rendered":"\n<p>By Ye Ju Ki<\/p>\n\n\n\n<!--more-->\n\n\n\n<h2 class=\"wp-block-heading\">From Screen, Forum, to Spreadsheet: Building the Metadata<\/h2>\n\n\n\n<p>In the recent Netflix series, <em>Sex Education, <\/em>Florence Simmons says that she \u201cmight be broken\u201d because she doesn\u2019t want to have sex at all. In response, Dr. Milburn introduces asexuality and validates how Florence is feeling: \u201cIt\u2019s when someone has no sexual attraction to any sex or gender. Sex just doesn\u2019t do it for some people. [\u2026] Some asexual people still want romantic relationships, but they don\u2019t want the sex bit. And others don\u2019t want either. You know, sexuality is fluid. Sex doesn\u2019t make us whole. And so, how could you ever be broken?\u201d<\/p>\n\n\n\n<p>In terms of openness to discussing asexuality on television, this conversation in a 2020 TV show has come a long way since Gregory House from <em>Dr. House<\/em> in 2012 called people who don\u2019t want to have sex \u201csick, dead, or lying\u201d, after an asexual patient was shown to have a tumor growing in his brain, lowering his libido and causing erectile dysfunction.<\/p>\n\n\n\n<p>This recent shift from discussing asexuality as a pathological mystery to offering asexuality as a valid sexual orientation represented a victory for the asexual community, and yet, it is only a partial one. According to the <em>Where We are on TV<\/em> reports published by GLAAD, there have been less than ten asexual characters represented on &nbsp;cable networks and streaming services between 2017 and 2025. To name a few, they are Raphael from <em>Shadowhunters<\/em>, Todd Chavez from <em>BoJack Horseman<\/em>, Drea from <em>Everything\u2019s Gonna Be Okay<\/em>, Greta from <em>genera+ion<\/em>, Elijah from <em>Big Mouth<\/em>, Ca$h from <em>Hearbreak High<\/em>, Issac from <em>Heartstopper<\/em>, and Florence Simmons and O from <em>Sex Education<\/em>.<\/p>\n\n\n\n<p>Inspired by the scarcity of asexual characters readily available in contemporary media, I am gathering explicit and implicit asexual characters scattered across popular media. In my<a href=\"https:\/\/sites.temple.edu\/tudsc\/2025\/10\/28\/asexuality-in-tv-and-film-visualizing-the-invisible-orientation-in-online-spaces\/\"> previous post<\/a>, I explained the general scope and goal of my Scholars Studio fellowship project. In this post, I explain the workflow of compiling a structured list of asexual characters in film and television, in order to build a dataset that could be &nbsp;visualized on &nbsp;an interactive website. Here are the steps I took to create metadata for building an online archive:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Step 1<\/strong>: Scraping data using <a href=\"https:\/\/webscraper.io\/\"><\/a><a href=\"https:\/\/webscraper.io\/\">Web Scraper<\/a> from various lists and websites, including <a href=\"https:\/\/www.asexuality.org\/en\/forum\/11-world-watch\/\"><\/a><a href=\"https:\/\/www.asexuality.org\/en\/forum\/11-world-watch\/\">AVEN World Watch Archive<\/a>, <a href=\"https:\/\/en.wikipedia.org\/wiki\/List_of_fictional_asexual_characters\"><\/a><a href=\"https:\/\/en.wikipedia.org\/wiki\/List_of_fictional_asexual_characters\">Wikipedia<\/a>, <a href=\"https:\/\/lgbtqia-characters.fandom.com\/wiki\/Category:Ace_Characters\"><\/a><a href=\"https:\/\/lgbtqia-characters.fandom.com\/wiki\/Category:Ace_Characters\">LGBTQIA+ Characters Wikia<\/a>, and <a href=\"https:\/\/lezwatchtv.com\/sexuality\/asexual\/\"><\/a><a href=\"https:\/\/lezwatchtv.com\/sexuality\/asexual\/\">LezWatch<\/a>, for gathering discussions online<\/li>\n\n\n\n<li><strong>Step 2<\/strong>: Cleaning the scraped data using OpenRefine and coding the data using ATLAS.ti<\/li>\n\n\n\n<li><strong>Step 3<\/strong>: Creating a dataset description and filling in the spreadsheet, for organizing the data into a structured list<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">Scraping Data from AVEN<\/h2>\n\n\n\n<p>I started with the Web Scraper to bring together lists of and discussions on asexual characters in media. The Web Scraper does not require any coding, and it is available as a Chrome extension. While I scraped available lists on Wikipedia, LGBTQIA+ Wikia, and LezWatch using the Web Scraper, I mainly referred to AVEN\u2019s World Watch forum as the main part of the metadata, as it contained the most lively and extensive discussions on media portrayals of asexuality. For my research &nbsp;project, I informed and obtained permission from the AVEN Project Team, in compliance with the community\u2019s guidelines and their rules for using AVEN for research.<\/p>\n\n\n\n<p>Instead of relying on the Web Scraper\u2019s automated AI Wizard, I used the Advanced Sitemap Builder, so I could build a more detailed sitemap for navigating the AVEN forum with over 12,000 posts. While I had to familiarize myself with elements like \u201cwrapper\u201d and \u201cparent-child\u201d to build a sitemap, the Web Scraper was an easy and straightforward tool for scraping organized data from websites. I first analyzed how the forum was structured between posts, threads, and pages, so I could map out a path for the scraper to follow. I then defined and designated different elements I wanted to scrape in each post on a sitemap. Once the data was scraped, I exported them as CSV files.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"277\" src=\"https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap-1024x277.png\" alt=\"\" class=\"wp-image-9884\" srcset=\"https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap-1024x277.png 1024w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap-300x81.png 300w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap-768x208.png 768w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap-1536x415.png 1536w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap-2048x554.png 2048w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap-850x230.png 850w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"264\" src=\"https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap_2-1024x264.png\" alt=\"\" class=\"wp-image-9885\" srcset=\"https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap_2-1024x264.png 1024w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap_2-300x77.png 300w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap_2-768x198.png 768w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap_2-1536x395.png 1536w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap_2-2048x527.png 2048w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Web-Scraper-Sitemap_2-850x219.png 850w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><em>Figure 1. Screenshots of the sitemap used for scraping the AVEN World Watch Archive forum <\/em><\/figcaption><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">Cleaning and Coding the Scraped Data<\/h2>\n\n\n\n<p>Once the scraped data was exported as CSV files, I cleaned them using OpenRefine to get rid of unnecessary elements, columns, and texts from the CSV file. For instance, I removed the pagination column and unnecessary spaces in each cell, using the \u201ctext transform\u201d function in OpenRefine. This was crucial for making the following steps easier, considering the amount of data in the spreadsheets.&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"516\" src=\"https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Open-Refine-1024x516.png\" alt=\"\" class=\"wp-image-9886\" srcset=\"https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Open-Refine-1024x516.png 1024w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Open-Refine-300x151.png 300w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Open-Refine-768x387.png 768w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Open-Refine-1536x774.png 1536w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Open-Refine-2048x1032.png 2048w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/Open-Refine-850x428.png 850w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><em>Figure 2. A screenshot of OpenRefine<\/em><\/figcaption><\/figure>\n\n\n\n<p>After the scraped data was cleaned, I exported them as CSV files, so I could code each post. While it was quite a tedious step to manually code each post, this was a necessary step for categorizing the asexual characters, turning posts into a structured list, and possible data visualization. I also needed a point of reference and to be able to locate the exact post each character was mentioned in, in case I needed to refer back to the post for more details.<\/p>\n\n\n\n<p>I started with importing the cleaned data into ATLAS.ti as survey data, so that each post in its respective row would be read as a survey response. Then, I proceeded to go through each post and code them manually based on the type of media (television, film, audio drama, book, comics, game, manga, YouTube, and research article) and asexuality status (confirmed and speculated).<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1011\" height=\"486\" src=\"https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/260225-Atlas-Survey-Import-Setting_AVEN-Archive-2021-2025.png\" alt=\"\" class=\"wp-image-9887\" srcset=\"https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/260225-Atlas-Survey-Import-Setting_AVEN-Archive-2021-2025.png 1011w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/260225-Atlas-Survey-Import-Setting_AVEN-Archive-2021-2025-300x144.png 300w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/260225-Atlas-Survey-Import-Setting_AVEN-Archive-2021-2025-768x369.png 768w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/260225-Atlas-Survey-Import-Setting_AVEN-Archive-2021-2025-850x409.png 850w\" sizes=\"auto, (max-width: 1011px) 100vw, 1011px\" \/><\/figure>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"397\" src=\"https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/QDA-Coding-1024x397.png\" alt=\"\" class=\"wp-image-9888\" srcset=\"https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/QDA-Coding-1024x397.png 1024w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/QDA-Coding-300x116.png 300w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/QDA-Coding-768x298.png 768w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/QDA-Coding-1536x596.png 1536w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/QDA-Coding-2048x795.png 2048w, https:\/\/sites.temple.edu\/tudsc\/files\/2026\/03\/QDA-Coding-850x330.png 850w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><em>Figure 3. Screenshots of importing the data as survey data and coding them on ATLAS.ti <\/em><\/figcaption><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\">Metadata Profile and Completing the Metadata<\/h2>\n\n\n\n<p>During the process of coding, I also developed a detailed metadata profile to determine which data would be meaningful for creating an archive dedicated to asexual characters in media. The goal was to move beyond simply listing basic information about each asexual character like character names, release years, and genres.<\/p>\n\n\n\n<p>The metadata profile included factual data, interpretative data, and character data. Factual data refers to objective data about each character, Interpretative data refers to data on how asexuality is represented through the character, and character data refers to data related to each character\u2019s asexuality.<\/p>\n\n\n\n<p>Based on the metadata profile and the dataset descriptions, I started filling out the cells in the spreadsheet. This was a manual process of looking up each character online, watching relevant television shows and films, and referring to respective Wikipedia and IMDb pages.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">From Metadata to Website: Building an Online Archive<\/h2>\n\n\n\n<p>While I am still coding and completing the metadata, I have started working on the final step: designing and creating a website that would function as an online archive that is crowdsourced and accessible to the public. Utilizing both Python and AI-assisted coding through Claude Code, I am building a website that uses the Hugo framework to host and visualize the list of asexual characters in film and television.<\/p>\n\n\n\n<p>The website is still in its infancy, but once completed and launched, it will provide a valuable resource for people on the asexual spectrum by making the dataset publicly available and readily accessible. Moreover, it will serve as an online space for visualizing the invisible orientation and fostering a community. For now, the dataset only includes asexual characters from film and television, but I hope to expand it to include asexual characters found in other types of media, such as video games and literature.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>By Ye Ju Ki<\/p>\n","protected":false},"author":35048,"featured_media":9895,"comment_status":"closed","ping_status":"","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[405,2,287],"tags":[525,524,401,6,408],"class_list":["post-9873","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-digital-humanities","category-grad-students","category-media-studies","tag-datasets","tag-digital-archives","tag-metadata","tag-top-news","tag-webscraping"],"_links":{"self":[{"href":"https:\/\/sites.temple.edu\/tudsc\/wp-json\/wp\/v2\/posts\/9873","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sites.temple.edu\/tudsc\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sites.temple.edu\/tudsc\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sites.temple.edu\/tudsc\/wp-json\/wp\/v2\/users\/35048"}],"replies":[{"embeddable":true,"href":"https:\/\/sites.temple.edu\/tudsc\/wp-json\/wp\/v2\/comments?post=9873"}],"version-history":[{"count":7,"href":"https:\/\/sites.temple.edu\/tudsc\/wp-json\/wp\/v2\/posts\/9873\/revisions"}],"predecessor-version":[{"id":9950,"href":"https:\/\/sites.temple.edu\/tudsc\/wp-json\/wp\/v2\/posts\/9873\/revisions\/9950"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/sites.temple.edu\/tudsc\/wp-json\/wp\/v2\/media\/9895"}],"wp:attachment":[{"href":"https:\/\/sites.temple.edu\/tudsc\/wp-json\/wp\/v2\/media?parent=9873"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sites.temple.edu\/tudsc\/wp-json\/wp\/v2\/categories?post=9873"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sites.temple.edu\/tudsc\/wp-json\/wp\/v2\/tags?post=9873"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}