Skip to main content
Have a personal or library account? Click to login
A Database of Person Marking in South-Central Trans-Himalayan Cover

A Database of Person Marking in South-Central Trans-Himalayan

Open Access
|Apr 2026

Figures & Tables

(1)a.ka-se
  1sg-go
  ‘I am going’
 b.se-mak-ung
  go-neg-1sg
  ‘I am not going’Ranglong
(2)a.kaː-tà-tì-nʉ́
  1:p-touch-2-nfut
  ‘you (SG) touch me’Anal Naga
 b.m̩̀-m̥ú-náː-tʃɘ̀
  inv-see-ipfv:tr-2
  ‘you (SG) saw me’Monsang
(3)a.a-t-déé 
  2-inv-see 
  ‘you (SG) see me’ 
 b.m-t-déé
  1-inv-see
  ‘you (SG) see me’Lamkang
Table 1

Languages included in the first release of PMST, with identifiers, group affiliation, collaborators and sources. Languages are ordered by group. Within Northwestern, languages are ordered by how closely related they are assumed to be following the impressionistic subgroupings in Konnerth (2022).

LANGUAGEGLOTTOCODEISOCODE1GROUPCOLLABORATORS & SOURCES
Ranglongrang1271(rnl)NorthwesternHunter Brown, Jessi Tara
Chiruchir1283cdfNorthwesternMechek Sampar Awan; Awan (2019)
Anal Nagaanal1239anmNorthwesternPavel Ozerov; Thotson Langhu; Ozerov (2019)
Monsangmons1234nmhNorthwesternLinda Konnerth, Koninglee Wanglar
Lamkanglamk1238lmkNorthwesternShobhana Chelliah, Rex Rengpu Khullar; Chelliah et al. (2019)
Hmarhmar1241hmrNorthwesternMarina Infimate
Pangkhuapank1249pkhCentralMohammed Zahid Akter; Akter (2024)
Hyowkhya1239(csh)SouthernMuhammad Zakaria; Zakaria (in press)
Figure 1

Location and group affiliation of the sample languages. The inset shows the location of the detailed map within South(east) Asia.

Table 2

Overview of tags used to annotate variation.

TAGCATEGORY OF TAGDESCRIPTION
defaultparadigm_tagunmarked form (most general, most frequent, etc.) or form that has no other tag
pragm_markedparadigm_tagpragmatically conditioned variant
hortparadigm_tagform is a hortative
emphparadigm_tagform is from an emphatic paradigm
unspec_varparadigm_tagvariant of (yet) unspecified distribution
generic_nftense_taggeneric non-future form
non_generic_nftense_tagnon-generic non-future form
pasttense_tagpast tense form
optional_pluraloverabundance_tagform that does not contain a marker for plural
optional_thirdoverabundance_tagform that does not contain a marker for third person
optional_futureoverabundance_tagform that does not contain a marker for future tense
variable_orderorder_tagform contains morphemes that can variably order
special_stemmorphanalysis_tagform has a special stem form in particular cells of a paradigm
tone_alt_stemmorphanalysis_tagform exhibits a tone alternation triggered by the stem
morphophonmorphanalysis_tagform exhibits morphophonological process(es)
copy_vphonanalysis_tagform has a copy vowel in at least one morpheme
dialect_varvariants_tagform from other dialect
sociolect_varvariants_tagform from other sociolect
Figure 2

Schematic overview of workflow and the connection between the working versions of the datasets on GitHub and the published versions on Zenodo. A, B, C represent individual languages.

Table 3

Dataset description.

Repository nameZenodo
Object namePMST-Database
Repository locationAll PMST datasets can be found at https://zenodo.org/communities/pmst/. For DOIs of individual datasets, please consult Table 4.
Format namescsv, json, md, yml
Creation dates2023-12-27 to 2025-12-10
Publication dateThe datasets pertaining to the first release of PMST were published between 2025-12-01 to 2025-12-10.
LicenseCC-BY-SA 4.0
Table 4

Languages (=datasets) included in the first release of PMST, with the number of forms, the number of scenarios,6 and their DOI.

LANGUAGEFORMSSCENARIOSZENODODOI
Anal Naga31118410.5281/zenodo.17881855
Chiru67416510.5281/zenodo.17779437
Hmar26716310.5281/zenodo.17779055
Hyow91735210.5281/zenodo.17788529
Lamkang29815810.5281/zenodo.17780049
Monsang33616310.5281/zenodo.17865713
Pangkhua14914510.5281/zenodo.17866617
Ranglong25514210.5281/zenodo.17778036
Figure 3

Overview of database modules and their relations. Two-way arrows indicate direct links between files, e.g., the forms file can be joined with the cells file via the cell/cell identifier which appears in both files. One-way arrows indicate subset relations, e.g., each phoneme in the phon_form columns appears in the sound file separately.

Figure 4

Distributional profile of morphs in the Ranglong dataset. The top panel shows the distribution across tense-aspect and polarity values. The middle panel shows the distribution across person configurations. The bottom panel shows the distribution across number categories. Stripes are used for elements appearing before the verb stem and circles for those appearing after.

Figure 5

Length of verb forms (minus the lexical stem) in phonemes of transitive affirmative scenarios aggregated per scenario and language. The dot indicates the average; the whiskers show the range. Languages are arranged by subgroup and relatedness (cf. Table 1).

1first person
2second person
3third person
AA (actor) argument of a transitive predicate
INVinverse
IPFVimperfective
NEGnegation
NFUTnon-future
PP (undergoer) argument of a transitive predicate
Ssole argument of an intransitive predicate
SAPspeech-act participant (first and second person)
SCSouth-Central (branch of Trans-Himalayan)
SGsingular
TRtransitive
DOI: https://doi.org/10.5334/johd.505 | Journal eISSN: 2059-481X
Language: English
Submitted on: Dec 19, 2025
Accepted on: Feb 16, 2026
Published on: Apr 7, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Sandra Auderset, Hunter L. Brown, Jonathan Reich, Pascal Gerber, Muhammad Zakaria, Linda Konnerth, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.