Have a personal or library account? Click to login
The EyCon Dataset: A Visual Corpus of Early Conflict Photography Cover

The EyCon Dataset: A Visual Corpus of Early Conflict Photography

Open Access
|Jul 2024

Abstract

The EyCon dataset, comprising nearly 130,000 JPEG images and pages, documents armed conflicts from the 1890s to 1918, with a focus on extra-European contexts. The project team aggregated thousands of digitized images and metadata from various institutions, including previously inaccessible documents. To enhance metadata, the team conducted visual and multimodal similarity analyses, as well as human and animal detection. Captions were processed to extract named entities for XML-formatted descriptive metadata. Challenges in identifying and publishing graphic images due to automated tools’ limitations in detecting violence were addressed with human expertise for accurate classification. Available online and on Zenodo for download and reuse, the dataset confronts issues in computer vision for heritage photographs, such as degradation from fading, discoloration, scratches and noise, which impair algorithms reliant on visual features. The under-representation of early photographic cultures in datasets introduces bias in applying standard solutions to archival materials.

DOI: https://doi.org/10.5334/johd.213 | Journal eISSN: 2059-481X
Language: English
Submitted on: Apr 3, 2024
Accepted on: Jun 7, 2024
Published on: Jul 4, 2024
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2024 Marina Giardinetti, Daniel Foliard, Julien Schuh, Mohamed-Salim Aissi, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.