Have a personal or library account? Click to login
PapyGreek Treebanks: A Dataset of Linguistically Annotated Greek Documentary Papyri Cover

PapyGreek Treebanks: A Dataset of Linguistically Annotated Greek Documentary Papyri

Open Access
|Nov 2021

Abstract

The PapyGreek Treebanks dataset contains documentary texts written in Postclassical Greek (ca. 300 BCE–700 CE), morphosyntactically annotated according to Dependency Grammar. The source of the texts is the Duke Databank of Documentary Papyri (DDbDP), which preserves the modern editorial treatment of the documents in TEI Epidoc XML encoding. Aiming to expose linguistic variation in the DDbDP, we have annotated two versions of a selection of documents: the plain transcription and an editorially corrected version. The dataset also comprises metadata about the documents’ dating and provenance, text type, and the persons involved. Furthermore, it facilitates linguistic research on these texts.

DOI: https://doi.org/10.5334/johd.55 | Journal eISSN: 2059-481X
Language: English
Published on: Nov 5, 2021
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2021 Marja Vierros, Erik Henriksson, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.