Table 1
Included Publications Utilising the SADiLaR Repository.
| ANONYMISED NAME | REPOSITORY CITATION | AUTHOR CATEGORY | LANGUAGE FOCUS | REUSE TYPE |
|---|---|---|---|---|
| P1 | General mention | External | Multi-language | Dataset creation |
| P2 | General mention | Self-reuse | Afrikaans | Direct research |
| P3 | Long URL | Self-reuse | Multi-language | Direct research |
| P4 | Long URL | Self-reuse | Multi-language | Dataset creation |
| P5 | Handle | Self-reuse | Multi-language | Direct research |
| P6 | Handle | Self-reuse | Multi-language | Dataset creation |
| P7 | Long URL | Self-reuse | Multi-language | Dataset creation |
| P8 | Handle | Self-reuse | Multi-language | Dataset creation |
| P9 | General mention | External | Multi-language | Direct research |
| P10 | General mention | External | Multi-language | Dataset creation |
| P11 | Handle | Self-reuse | Multi-language | Dataset creation |
| P12 | General mention | External | Multi-language | Tool creation |
| P13 | Long URL | Internal (SADiLaR) | Multi-language | Tool assessment |
| P14 | Long URL | Self-reuse | Multi-language | Tool creation |
| P15 | Handle | Self-reuse | Afrikaans | Dataset creation |
| P16 | General mention | Internal (SADiLaR) | Multi-language | Direct research |
| P17 | Handle | Self-reuse | Multi-language | Dataset creation |
| P18 | Long URL | Self-reuse | Sesotho | Tool creation |
| P19 | General mention | Internal (SADiLaR) | Sesotho | Resource assessment |
| P20 | General mention | Internal (SADiLaR) | Multi-language | Tool assessment |
| P21 | Handle | External | Multi-language | Direct research |
| P22 | Handle | Internal (SADiLaR) | Afrikaans | Direct research |
| P23 | Long URL | Self-reuse | Afrikaans | Direct research |

Figure 1
Number of Publications by Author and Reuse Type.

Figure 2
Weekly activity for bot and human users.

Figure 3
Activity on the SADiLaR Repository by User Type.

Figure 4
Map of Human Repository Users.
Supplementary file A
Full List of Included Publications Utilising the SADiLaR Repository.
| AUTHORS | YEAR | TITLE |
|---|---|---|
| Adelani, D.I., et al. | 2022 | MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity Recognition |
| Brink, N. | 2020 | A usage-based investigation of Afrikaans-speaking children’s holophrases and communicative intentions |
| De Wet, F., et al. | 2023 | Investigating the Extent and Usability of Webtext Available in South Africa’s Official Languages |
| Du Toit, J. S., & Puttkammer, M. J. | 2021 | Developing Core Technologies for Resource-Scarce Nguni Languages |
| Eiselen, R., & Gaustad, T. | 2023 | Deep learning and low-resource languages: How much data is enough? A case study of three linguistically distinct South African languages |
| Gaustad, T., & McKellar, C. A. | 2024 | Updated Morphologically Annotated Corpora for 9 South African Languages |
| Gaustad, T., & Puttkammer, M. J. | 2022 | Linguistically annotated dataset for four official South African languages with a conjunctive orthography: IsiNdebele, isiXhosa, isiZulu, and Siswati |
| Gaustad, T., et al. | 2025 | Datasets for South African Languages: Bilingual Aligned and Monolingual Data for Machine Translation |
| Kaffee, L.-A., et al. | 2023 | Multilingual Knowledge Graphs and Low-Resource Languages: A Review |
| Marivate, V., et al. | 2025 | Swivuriso: The South African Next Voices Multilingual Speech Dataset |
| McKellar, C. A., & Puttkammer, M. J. | 2020 | Dataset for comparable evaluation of machine translation between 11 South African languages |
| Meyer, F., et al. | 2024 | NGLUEni: Benchmarking and Adapting Pretrained Language Models for Nguni Languages |
| Mlambo, R. & Matfunjwa, M. | 2025 | Human language technology tools for indigenous South African languages and their potential use |
| Puttkammer, M., et al. | 2018 | NLP Web Services for Resource-Scarce Languages |
| Rabé, M. | 2021 | Kodewisseling in Afrikaans-Nederlandse kinders se spraak |
| Setaka, M., & Trollip, B. | 2022 | Resource Repositories and linking resources: An exploratory study |
| Sibeko, J., & Van Zaanen, M. | 2023 | A Data Set of Final Year High School Examination Texts of South African Home and First Additional Language Subjects |
| Sibeko, J., & Van Zaanen, M. | 2025 | Developing and testing syllabification systems for South African Sesotho |
| Sibeko, J., & Setaka, M. | 2023 | An overview of Sesotho BLARK content |
| Skosana, N. J., & Mlambo, R. | 2021 | A brief study of the Autshumato Machine Translation Web Service for South African languages |
| Terblanche, C., et al. | 2025 | The development of synthetic child speech in three South African languages |
| Trollip, B. | 2023 | ’n Gebruiksgebaseerde beskrywing van Afrikaanse prefiksoïede |
| Trollip, B., & Strauss, T. | 2024 | Analysing Afrikaans lexical blends using Levenshtein distances |
