Have a personal or library account? Click to login
Running Automatic Transcription for the Most Challenging Sources: Tabular Parish Registers from Transylvania Cover

Running Automatic Transcription for the Most Challenging Sources: Tabular Parish Registers from Transylvania

Open Access
|Feb 2026

Abstract

The paper explores the challenges and adaptations required for applying Transkribus to the parish registers from Transylvania, used as main sources for the HPDT and the complexity introduced by linguistic, alphabetic, and structural diversity inherent to the sources. The denominational and ethnic mosaic of Transylvania has resulted in parish registers written in a variety of languages—Latin, Hungarian, German, and Romanian— and using diverse alphabets, including Latin, Cyrillic, and Kurrentschrift. In many instances, these multilingual registers coexist within records from the same village, further complicating the recognition process. The structure of these tables, which can vary significantly, adds another layer of complexity.

The discussion is focused on adapting and developing the Transkribus platform to accurately recognize and convert the handwritten text into a format compatible with database structure. The adaptation process involves fine-tuning recognition models for each specific language, alphabet, and tabular configuration.

Language: English
Page range: 185 - 201
Published on: Feb 5, 2026
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Luminița Dumănescu, Angela Lumezeanu, Nicoleta Hegedüs, published by Lucian Blaga University of Sibiu
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.