Skip to main content

Extending CLDF — Towards a Type System for Cross-Linguistic Data Cover

Extending CLDF — Towards a Type System for Cross-Linguistic Data

Journal of Open Humanities Data

Volume 12 (2026): Issue 1

By: Robert Forkel and Johann-Mattis List

Open Access

|Apr 2026

Figures & tables

Figures & Tables

Words for “mother” sampled across different languages in Pallas (1789: 10).

Pulmonic consonants of the variety Chevroux in the TPPSR.

The Phlorest project exemplifies the data curation workflow described here: Once a data type is identified, dedicated curation workflows can be supported with tools such as Python packages. Publication of datasets can then be modeled as “release” in GitHub’s standard workflow and the integration with Zenodo makes sure such releases are actually pushed to a longterm archive for scientific content.

Complex data in etymological dictionaries can be constructed from simpler data types which are already standardized in CLDF. In addition to the tabular data known from cognate coded wordlists the reconstruction tree adds direction and order to the reconstructed protoforms.

Articles in this issue

DOI: https://doi.org/10.5334/johd.517 | Journal eISSN: 2059-481X

Journal RSS Feed

Language: English

Page range: 62 - 62

Submitted on: Jan 30, 2026

|

Accepted on: Mar 26, 2026

|

Published on: Apr 29, 2026

Published by: Ubiquity Press

In partnership with: Paradigm Publishing Services

Publication frequency: 1 issue per year

Keywords:

cross-linguistic data,

Cross-Linguistic Data Formats

© 2026 Robert Forkel, Johann-Mattis List, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.

Volume 12 (2026): Issue 1

Previous article