Have a personal or library account? Click to login
Methods for Extracting Relational Data from Unstructured Texts Prior to Network Visualization in Humanities Research Cover

Methods for Extracting Relational Data from Unstructured Texts Prior to Network Visualization in Humanities Research

Open Access
|Nov 2020

Abstract

Network modelling methodologies in the digital humanities have been be used to advance inquiry in a variety of areas—most commonly those having to do with correspondence, citation, and social media networks. While new technologies have made generating high-quality and even dynamic network visualizations relatively easy, key challenges remain for humanities researchers. Many common objects of humanistic inquiry, such as literary, historiographic, and biographical texts are often not easily transformed into the kinds of data structures necessary for network visualization. The Transparency to Visibility (T2V) Project was initiated to develop new methods and toolkits that can support humanistic researchers who need to extract relationship data from unstructured texts to support network visualization. The T2V team used bioethics accountability statements to pilot and evaluate different methods for extracting relationship data. The resulting machine-learning-enhanced natural language processing (NLP) and metadata-assisted approaches offer promising potential pathways for contemporary digital humanities and future toolkit development.

 

Funding statement: Funding was provided by a National Endowment for the Humanities Digital Humanities Advancement grant (HAA-261070).

DOI: https://doi.org/10.5334/johd.21 | Journal eISSN: 2059-481X
Language: English
Published on: Nov 19, 2020
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2020 S. Scott Graham, Zoltan P. Majdik, Dave Clark, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.