Have a personal or library account? Click to login
Financial Question-answering Dataset for Slovak Language Model Evaluation Cover

Financial Question-answering Dataset for Slovak Language Model Evaluation

Open Access
|Nov 2025

Abstract

The limited availability of language resources for Slovak presents a significant challenge for the development and evaluation of language models. In this paper, we introduce a multiple-choice question-answering dataset specifically designed for the financial domain in Slovak. The dataset contains 1,334 questions, each with one correct answer and four incorrect ones. It is systematically organized by topic and difficulty level to facilitate structured evaluation. Using this dataset, we assess the performance of several Slovak generative language models and compare their results against a general question-answering dataset to analyze domain-specific model capabilities. The best-performing model is a monolingual Slovak model. Furthermore, the observed performance differences between financial-domain and general question-answering tasks suggest that domain-specific language modeling requires further research.

DOI: https://doi.org/10.2478/jazcas-2025-0022 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 247 - 257
Published on: Nov 27, 2025
Published by: Slovak Academy of Sciences, Mathematical Institute
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2025 Daniel Hládek, Kristián Sopkovič, Ján Staš, Zuzana Sokolová, Matúš Pleva, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.