Have a personal or library account? Click to login
From Experiments to Epistemic Practice: The RISE Humanities Data Benchmark Cover

From Experiments to Epistemic Practice: The RISE Humanities Data Benchmark

Open Access
|Mar 2026

Abstract

The RISE Humanities Benchmark suite emerged from concrete research support practices undertaken by the Research and Infrastructure Support (RISE) unit for humanities and social science researchers at the University of Basel. At RISE, our digital-humanities consulting and infrastructure work frequently involves evaluating computational methods on historical and multilingual text and image data. Over time, these evaluations produced a body of tacit, methodological insights to a series of recurring questions. For example, which large language models (LLMs) handle historical handwriting reliably, which configurations balance cost and accuracy, and which types of visual layouts lead to systematic failures? The RISE Humanities Benchmark suite provides a means to transform this accumulated experience into a structured framework that can be used to reference, verify, and extend such observations. More broadly, the goal of the suite is to enable the wider humanities community to perform informed assessments of LLMs against their own data without specialized technical expertise. By publishing procedures, datasets, and metrics in a consistent, open format, the suite aims to lower the threshold for evidence-based decision-making in computational humanities projects by making the grounds for such decisions explicit and contestable.

DOI: https://doi.org/10.5334/johd.470 | Journal eISSN: 2059-481X
Language: English
Submitted on: Nov 15, 2025
|
Accepted on: Jan 21, 2026
|
Published on: Mar 2, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Maximilian Hindermann, Lea Katharina Kasper, Sorin Marti, Arno Bosse, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.