Have a personal or library account? Click to login
A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers Cover

A Dataset for Toponym Resolution in Nineteenth-Century English Newspapers

Open Access
|Jan 2022

Abstract

We present a new dataset for the task of toponym resolution in digitized historical newspapers in English. It consists of 343 annotated articles from newspapers based in four different locations in England (Manchester, Ashton-under-Lyne, Poole and Dorchester), published between 1780 and 1870. The articles have been manually annotated with mentions of places, which are linked—whenever possible—to their corresponding entry on Wikipedia. The dataset consists of 3,364 annotated toponyms, of which 2,784 have been provided with a link to Wikipedia. The dataset is published in the British Library shared research repository, and is especially of interest to researchers working on improving semantic access to historical newspaper content.

DOI: https://doi.org/10.5334/johd.56 | Journal eISSN: 2059-481X
Language: English
Published on: Jan 24, 2022
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2022 Mariona Coll Ardanuy, David Beavan, Kaspar Beelen, Kasra Hosseini, Jon Lawrence, Katherine McDonough, Federico Nanni, Daniel van Strien, Daniel C. S. Wilson, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.