Skip to main content
Have a personal or library account? Click to login
Fine-Tuning South Tyrolean Dialect-to-Standard German ASR with AlpiLinK Cover

Fine-Tuning South Tyrolean Dialect-to-Standard German ASR with AlpiLinK

Open Access
|Jun 2026

Figures & Tables

Table 1

Current overview of ASR training data used in this study (v1.3).

SOURCETYPEHOURSSPEAKERSAGE GROUPPROVENANCE
Learner textbookscripted47 m320–49public
AlpiLinKscripted4 h 47 m18010–89public
In-house recordingsscripted, spontaneous3 h 42 m320–49in-house, contributed
Audiovisual archivesscripted, spontaneous1 h 53 m90–99contributed
Promotional videosspontaneous1 h 03 m8620–79public
TOTAL13 h 16 m
Table 2

Training data evolution and ASR performance computed on the same held-out test set derived from version v1.3 of the training data. Model v1.2 yields the best overall performance.

MODELTRAINING DATAPERFORMANCE (V1.3)
TOKENSDURATIONWER ↓BLEU ↑
baselinen/a (no fine-tuning)0.4644.58
v1.051,4746 h 39 m0.370.52
v1.175,2039 h 07 m0.2765.65
v1.287,29810 h 16 m0.2469.13
v1.388,81010 h 26 m0.2468.73
DOI: https://doi.org/10.5334/johd.533 | Journal eISSN: 2059-481X
Language: English
Page range: 74 - 74
Submitted on: Mar 1, 2026
Accepted on: May 5, 2026
Published on: Jun 8, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Greta H. Franzini, Luca Ducceschi, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.