Have a personal or library account? Click to login

Using a parallel corpus to adapt the Flesch Reading Ease formula to Czech

Open Access
|Dec 2021

Abstract

Text readability metrics assess how much effort a reader must put into comprehending a given text. They are, e.g., used to choose appropriate readings for different student proficiency levels, or to make sure that crucial information is efficiently conveyed (e.g., in an emergency). Flesch Reading Ease is such a globally used formula that it is even integrated into the MS Word Processor. However, its constants are language-dependent. The original formula was created for English. So far it has been adapted to several European languages, Bangla, and Hindi. This paper describes the Czech adaptation, with the language-dependent constants optimized by a machine-learning algorithm working on parallel corpora of Czech and English, Russian, Italian, and French, respectively.

DOI: https://doi.org/10.2478/jazcas-2021-0044 | Journal eISSN: 1338-4287 | Journal ISSN: 0021-5597
Language: English
Page range: 477 - 487
Published on: Dec 30, 2021
Published by: Slovak Academy of Sciences, Mathematical Institute
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2021 Klára Bendová, published by Slovak Academy of Sciences, Mathematical Institute
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.