Have a personal or library account? Click to login
The RISE Humanities Data Benchmark: A Framework for Evaluating Large Language Models for Humanities Tasks Cover

The RISE Humanities Data Benchmark: A Framework for Evaluating Large Language Models for Humanities Tasks

Open Access
|Feb 2026

Abstract

The RISE Humanities Data Benchmark is a framework and collection of curated datasets for evaluating large language models (LLMs) on humanities-related tasks. The datasets are designed to be small and task-specific and are each accompanied by manually verified ground truths. An accompanying tool systematically submits the datasets to various LLM providers and models using shared prompts and configurations, then automatically scores the results against the ground truths. The results are published and searchable through a web interface. The framework aims to promote greater reproducibility, transparency, and consistency in LLM-based data processing in the humanities.

DOI: https://doi.org/10.5334/johd.481 | Journal eISSN: 2059-481X
Language: English
Submitted on: Nov 15, 2025
|
Accepted on: Jan 7, 2026
|
Published on: Feb 4, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Maximilian Hindermann, Sorin Marti, Lea Katharina Kaspera, Arno Bossea, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.