Have a personal or library account? Click to login
A Global Lexical Database (GLED) for Computational Historical Linguistics Cover

A Global Lexical Database (GLED) for Computational Historical Linguistics

By: Tiago Tresoldi  
Open Access
|Feb 2023

Abstract

This work presents a lexical database with cognate annotation and phonological alignment for over 6,500 documented language varieties. The database includes per-family and global phylogenetic resources and offers a pre-computed global tree for language variety distance from normalized trees obtained with Bayesian Markov Chain Monte Carlo (MCMC) inference. Lexical data is provided in a single tabular file for convenience of usage, and resources are built adhering to best practices and state-of-the-art algorithms for historical linguistics. The database is a convenient source for research prototypes, method development, and analysis bootstrap. All resources are freely available for download for all interested researchers.

DOI: https://doi.org/10.5334/johd.96 | Journal eISSN: 2059-481X
Language: English
Published on: Feb 2, 2023
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2023 Tiago Tresoldi, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.