Have a personal or library account? Click to login
Topological Data Analysis and Multiple Kernel Learning for Species Identification of Modern and Archaeological Small Ruminants Cover

Topological Data Analysis and Multiple Kernel Learning for Species Identification of Modern and Archaeological Small Ruminants

Open Access
|May 2025

Figures & Tables

Table 1

Summary of sampled modern and archaeological species.

SPECIESVARIETYCURATORNUMBER
Alpine ibex (Capra ibex)Archaeological from Southern Alps (Liguro-Provençal Bassin)Museum of Prehistoric Anthropology of Monaco – Archeological site of “Grotte de l’Observatoire”29
Modern from AlpsOsteological collection from Thierry Argant (Éveha Lyon, ArAr UMR 5138)1
Goat (Capra sp.)Modern Capra nubiana (zoological specimen)National Museum of Natural History Paris’ Mammalian and Birds collection2
Modern domestic goat from FranceOsteological collections from Archéorient UMR 5133; National Museum of Natural History Paris’ Mammalian and Birds collection5
Modern domestic goat from IranOsteological collections from Bioarchaeology Laboratory, University of Tehran, Iran20
Modern domestic goat from Egypt (zoological specimen)National Museum of Natural History Paris’ Mammalian and Birds collection2
Modern feral goat from Crete (Capra aegagrus cretica)Osteological collections from Archéorient UMR 51331
Roe deer (Capreolus capreolus)Modern from FranceNational Museum of Natural History Paris’ Mammalian and Birds collection30
Gazelle (Gazella sp.)Modern Gazella cuvieriNational Museum of Natural History Paris’ Mammalian and Birds collection2
Modern Gazella dorcasNational Museum of Natural History Paris’ Mammalian and Birds collection5
Modern Gazella spekeiNational Museum of Natural History Paris’ Mammalian and Birds collection1
Modern Gazella sp.National Museum of Natural History Paris’ Mammalian and Birds collection; Osteological collections from Archéorient UMR 51337
Archaeological from Syria (Gazella cf. subgutturosa?)Daniel Helmer – Emmanuelle Vila (UMR 5133 Archéorient)15
Sheep (Ovis aries)Modern sheep from FranceOsteological collections from CEPAM UMR 7264, AASPE UMR 7209 & Archéorient UMR 51337
Modern sheep from EthiopiaILRI – Agraw Amane – Emmanuelle Vila – EvoSheep projet (ANR ANR-17-CE27-0004)23
Total of 3D astragali150
jcaa-8-1-181-g1.png
Figure 1

3D astragalus presented in dorsal view of (a) Alpine ibex (Capra ibex), (b) sheep (Ovis aries), (c) goat (Capra hircus), (d) roe deer (Capreolus capreolus) and (e) gazelle (Gazella sp.). Alpine ibex astragalus is untextured.

jcaa-8-1-181-g2.png
Figure 2

Evolution of the simplicial complex of a 3D astragalus (reference name Obs_1997_187) during the Alpha filtration process, highlighting the impact of varying radius thresholds r. (Left) At this stage, the bone’s structure is partially reconstructed, with multiple connected components and some topological cycles, 1-dimensional features, visible. (Center) The Alpha complex has merged into a single connected component, capturing the overall structure of the bone. The cycles from the previous step have been filled (died), and the complex closely approximates the 3D shape of the bone. (Right) At this advanced stage, most topological cycles have disappeared, and the complex over-reconstructs the bone’s shape. The filtration process concludes once all topological voids, 2-dimensional features, are filled.

jcaa-8-1-181-g3.png
Figure 3

Wasserstein kernel matrices without bone’s normalisation. Up: topological dimension 1; Down: topological dimension 2. For both the matrices the color code, indicated by the colour bar on the right of the matrix, represents pairwise similarity within the range [0, 1]. Yellow cells (similarity equals to 1), such as those along the diagonal, signify that the x and y bones are identical. As the color shifts towards blue, the bones exhibit increasing dissimilarity (similarity approaching 0).

jcaa-8-1-181-g4.png
Figure 4

Illustration of topological dissimilarities observed in sheep in the Wasserstein kernel matrices without bone’s normalisation (dimension 1). Picture of sheep breed “Bonga” (a) and “Menz” (b) from Ethiopia © A. Amane / E. Vila. c) Picture of sheep breed “Landes de Bretagne” from France ©H. Ronné https://www.ecomusee-rennes-metropole.fr/le-mouton-des-landes/.

jcaa-8-1-181-g5.png
Figure 5

Wasserstein kernel matrices with bone’s normalisation. Up: topological dimension 1; Down: topological dimension 2. For both the matrices the color code, indicated by the colour bar on the right of the matrix, represents pairwise similarity within the range [0, 1]. Yellow cells (similarity equals to 1), such as those along the diagonal, signify that the x and y bones are identical. As the color shifts towards blue, the bones exhibit increasing dissimilarity (similarity approaching 0).

Table 2

Average test classification accuracy.

ACC. std.dv.ALPINE IBEXGOATROE DEERGAZELLESHEEP
Multiple KLR0.8110.0640.9730.0610.6230.2180.9270.1010.8970.1240.6370.228
jcaa-8-1-181-g6.png
Figure 6a

Boxplot average test classification accuracies. The first column (leftmost) represents the average performance over the five classes, while the others show the average accuracy for each specie. The central line (orange) within each box represents the median accuracy, while the lower and upper edges of the box correspond to the first (Q1) and third quartiles (Q3), respectively, indicating the interquartile range (IQR). The whiskers extend to the minimum and maximum values within (1.5*IQR) from Q1 and Q3. Beyond this range are considered outliers and are shown as individual markers.

Table 3

Kernel weight’s estimates.

KERNEL WEIGHTS (POSTERIOR MEAN)
β1β2β3β4
IW-SWI0.2310.0070.2330.0060.2750.0100.4810.020
jcaa-8-1-181-g7.png
Figure 6b

Kernel densities estimates. Different colors correspond to the weights of the four Wasserstein kernel matrices obtained via TDA. The X axis corresponds to weights values, while the y axis indicates the estimated density, reflecting the relative frequency of occurrence. Peaks in the density curves show the most probable values of the weights, while the spread of each distribution provides insight into the variability and uncertainty of the estimates.

jcaa-8-1-181-g8.png
Figure 7

Learned kernel matrix for a single data split. It represents the optimal linear combination, where each input kernel matrix is weighted by its corresponding estimated weight value. The color code follows the same interpretation as in Figures 3 and 5.

DOI: https://doi.org/10.5334/jcaa.181 | Journal eISSN: 2514-8362
Language: English
Submitted on: Sep 13, 2024
Accepted on: Apr 24, 2025
Published on: May 23, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Manon Vuillien, Davide Adamo, Emmanuelle Vila, Amane Agraw, Thierry Argant, Daniel Helmer, Marjan Mashkour, Abdelkader Moussous, Olivier Notter, Elena Rossoni-Notter, Isabelle Théry, Marco Corneli, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.