Have a personal or library account? Click to login
When Data Meet Tools: Using the Monitor Corpus for the Analysis of Language Development Cover

When Data Meet Tools: Using the Monitor Corpus for the Analysis of Language Development

Open Access
|Nov 2025

Abstract

The aim of this paper is to introduce an infrastructure developed within the HiČKoK project to enable full-fledged corpus-based diachronic research of Czech. The individual sections of the paper present the components of this infrastructure, which links well-balanced, representative and annotated data with tailor-made tools for diachronic research. The forthcoming monitor corpus, covering the entire period of written Czech, along with its composition and annotation strategies, is briefly introduced. In the following sections, the potential of the application and its four modules—simple query, comparison, time-based associations, and diachronic collocations—are demonstrated through mini case studies. Combining large-scale data (as representative as possible) with a tool that enhances standard corpus functionalities, enriches them with a diachronic perspective, and enables result visualization makes diachronic research on language change more accessible and comprehensive.

DOI: https://doi.org/10.2478/jazcas-2025-0014 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 157 - 166
Published on: Nov 27, 2025
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2025 Václav Cvrček, Martin Stluka, Klára Pivoňková, published by Slovak Academy of Sciences, Ľudovít Štúr Institute of Linguistics
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.