Skip to main content
Have a personal or library account? Click to login
Extending CLDF — Towards a Type System for Cross-Linguistic Data Cover

Extending CLDF — Towards a Type System for Cross-Linguistic Data

Open Access
|Apr 2026

Figures & Tables

Figure 1

Words for “mother” sampled across different languages in Pallas (1789: 10).

Figure 2

Pulmonic consonants of the variety Chevroux in the TPPSR.

Figure 3

The Phlorest project exemplifies the data curation workflow described here: Once a data type is identified, dedicated curation workflows can be supported with tools such as Python packages. Publication of datasets can then be modeled as “release” in GitHub’s standard workflow and the integration with Zenodo makes sure such releases are actually pushed to a longterm archive for scientific content.

Figure 4

Complex data in etymological dictionaries can be constructed from simpler data types which are already standardized in CLDF. In addition to the tabular data known from cognate coded wordlists the reconstruction tree adds direction and order to the reconstructed protoforms.

DOI: https://doi.org/10.5334/johd.517 | Journal eISSN: 2059-481X
Language: English
Page range: 62 - 62
Submitted on: Jan 30, 2026
Accepted on: Mar 26, 2026
Published on: Apr 29, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Robert Forkel, Johann-Mattis List, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.