Have a personal or library account? Click to login
Lemmatization of the DIA1900 Diachronic Corpus Cover
Open Access
|Dec 2023

Abstract

This paper focuses on the process of lemmatization of the upcoming Czech diachronic corpus of the second half of the 19th century, DIA1900. The article describes different approaches to the corpus lemmatization of synchronic written, spoken and diachronic corpora within the Czech National Corpus project, including single- and multilevel lemmatization and available tools used to link the variants.

DOI: https://doi.org/10.2478/jazcas-2023-0045 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 275 - 284
Published on: Dec 25, 2023
Published by: Slovak Academy of Sciences, Mathematical Institute
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2023 Lucie Benešová, Klára Pivoňková, Martin Stluka, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.