Table 1
Number of doculects per number of concepts expressed in absolute and relative terms. Note that the number of entries for a doculect will be higher than the number of concepts in the case of synonyms.
| NUMBER OF CONCEPTS | DOCULECTS | PERCENTAGE OF DOCULECTS |
|---|---|---|
| 30 | 330 | 5.0 |
| 31 | 306 | 4.7 |
| 32 | 361 | 5.5 |
| 33 | 401 | 6.1 |
| 34 | 595 | 9.1 |
| 35 | 627 | 9.5 |
| 36 | 786 | 12.0 |
| 37 | 605 | 9.2 |
| 38 | 627 | 9.5 |
| 39 | 736 | 11.2 |
| 40 | 1198 | 18.2 |

Figure 1
Location of the doculects included in the dataset, using information from Hammarström et al. (2022); colours are automatically assigned to differentiate language families.
Table 2
Absolute and relative doculect coverage per concept, along with the Concepticon mapping for each concept.
| CONCEPT GLOSS | DOCULECTS (RATIO) | CONCEPTICON NAME/ID |
|---|---|---|
| 1pl | 5265 (0.801) | WE/1212 |
| 1sg | 5379 (0.818) | I/1209 |
| 2sg | 5231 (0.795) | THOU/1215 |
| blood | 6426 (0.977) | BLOOD/946 |
| bone | 6351 (0.966) | BONE/1394 |
| breast | 5957 (0.906) | BREAST/1402 |
| come | 6130 (0.932) | COME/1446 |
| die | 6125 (0.931) | DIE/1494 |
| dog | 6430 (0.978) | DOG/2009 |
| drink | 6058 (0.921) | DRINK/1401 |
| ear | 6475 (0.985) | EAR/1247 |
| eye | 6494 (0.988) | EYE/1248 |
| fire | 6417 (0.976) | FIRE/221 |
| fish | 6226 (0.947) | FISH/227 |
| full | 4190 (0.637) | FULL/1429 |
| hand | 5693 (0.866) | HAND/1277 |
| hear | 5898 (0.897) | HEAR/1408 |
| horn | 4317 (0.656) | HORN (ANATOMY)/1393 |
| knee | 5357 (0.815) | KNEE/1371 |
| leaf | 6077 (0.924) | LEAF/628 |
| liver | 5454 (0.829) | LIVER/1224 |
| louse | 5711 (0.868) | LOUSE/1392 |
| mountain | 5321 (0.809) | MOUNTAIN/639 |
| name | 6042 (0.919) | NAME/1405 |
| new | 5711 (0.868) | NEW/1231 |
| night | 6289 (0.956) | NIGHT/1233 |
| nose | 6404 (0.974) | NOSE/1221 |
| one | 6296 (0.958) | ONE/1493 |
| path | 6151 (0.935) | PATH/2252 |
| person | 5552 (0.844) | PERSON/683 |
| see | 6104 (0.928) | SEE/1409 |
| skin | 6182 (0.940) | SKIN/763 |
| star | 6220 (0.946) | STAR/1430 |
| stone | 6290 (0.957) | STONE/857 |
| sun | 5877 (0.894) | SUN/1343 |
| tongue | 6430 (0.978) | TONGUE/1205 |
| tooth | 6399 (0.973) | TOOTH/1380 |
| tree | 5850 (0.890) | TREE/906 |
| two | 6285 (0.956) | TWO/1498 |
| water | 6413 (0.975) | WATER/948 |
Table 3
A modified snippet from the lexical dataset, showing the most critical columns for a subset of Tupian words for the concept “dog”. The data includes a unique language name, a Glottocode (when available), the family name, a concept gloss derived from the Concepticon catalog, the phonological transcription of the word, the phonological alignment of the word in its cognate set (with hyphens indicating gaps), and a cognate set index.
| LANGUAGE | CODE | FAMILY | CONCEPT | FORM | ALIGNMENT | COGSET |
|---|---|---|---|---|---|---|
| Aché | ache1246 | Tupian | DOG | bɐegi | b ɐ e g i | 16 |
| Amundava | amun1246 | Tupian | DOG | ɲɐɲwɐrɐ | ɲ ɐ ɲ w - ɐ r ɐ | 17 |
| Avá Canoeiro | avac1239 | Tupian | DOG | jɐwɐrɐ | j ɐ - w - ɐ r ɐ | 17 |
| Paraguayan Guaraní | para1311 | Tupian | DOG | dʒɐgwɐ | dʒ ɐ g w - ɐ - - | 17 |
| Kaiwá | kaiw1246 | Tupian | DOG | jɐgwɐ | j ɐ g w - ɐ - - | 17 |
| Eastern Bolivian Guaraní | east2555 | Tupian | DOG | jeimbɐ | j e - i m b ɐ | 19 |
| Tapieté | tapi1253 | Tupian | DOG | ɲɐʔəmbɐ | ɲ ɐ ʔ ə m b ɐ | 19 |
| Cinta Larga | cint1239 | Tupian | DOG | ɐwəli | ɐ w ə l i | 20 |
| Gavião Do Jiparaná | gavi1246 | Tupian | DOG | ɐvələ | ɐ v ə l ə | 20 |

Figure 2
A neighbour-net for the Tupian languages in the dataset, plotted with SplitsTree v4 (Huson & Bryant, 2006).

Figure 3
The “global” language tree from the combined Bayesian MCMC phylogenetic inferences, plotted with iTOL (Letunic & Bork, 2021).
Table 4
Distance between Swedish (swed1254) and other languages, as computed using the Neighbour Joining trees (NJ, from zero to infinite), the Bayesian trees (B, from zero to 4.0), and the normalized Bayesian trees (NB, from zero to 1.0).
| LANGUAGE (GLOTTOCODE) | NJ | B | NB |
|---|---|---|---|
| Norwegian Bokmål (norw1259) | 0.21 | 0.11 | 0.02 |
| Danish (dani1285) | 0.24 | 0.02 | 0.01 |
| Dutch (dutc1256) | 0.41 | 1.40 | 0.35 |
| English (stan1293) | 0.42 | 1.40 | 0.35 |
| Italian (ital1282) | 0.84 | 1.60 | 0.40 |
| Hindi (hind1269) | 0.90 | 1.95 | 0.48 |
| Hittite (hitt1242) | 0.90 | 1.97 | 0.49 |
| Basque (basq1248) | ∞ | 4.00 | 1.00 |
