Have a personal or library account? Click to login
Text Recognition for Nepalese Manuscripts in Pracalit Script Cover

Text Recognition for Nepalese Manuscripts in Pracalit Script

Open Access
|Nov 2022

Abstract

This dataset is a model for handwritten text recognition (HTR) of Sanskrit and Newar Nepalese manuscripts in Pracalit script. This paper introduces the state of the field in Newar literature, Newar manuscripts, and HTR engines. It explains our methodology for developing the requisite ground truth consisting of manuscript images and corresponding transcriptions, training our model with a PyLAia engine, and this model’s limitations. This dataset shared on Zenodo can be used by anyone working with manuscripts in Pracalit script, which will benefit the fields of Indology and Newar studies, as well as historical and linguistic analysis.

DOI: https://doi.org/10.5334/johd.90 | Journal eISSN: 2059-481X
Language: English
Published on: Nov 30, 2022
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2022 Alexander James O’Neill, Nathan Hill, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.