
Building Responsible and Sustainable Open Data Literacy Skills for Early Career Researchers: A Decade of the SoRDS Programme
References
- Alzate-Cardona, J.D., Sabogal-Suárez, D., Arbeláez-Echeverri, O.D. and Restrepo-Parra, E. (2018) ‘VEGAS: Software package for the atomistic simulation of magnetic materials’, Revista Mexicana de Física, 64(5), pp. 490–497. Available at: 10.31349/RevMexFis.64.490
- Basalti, C., Fazekas Paragh, J., Forni, M., van Gelder, C., Hasani Mavriqi, I., Janik, J., Kalová, T., Kuchma, I., Lindroos, H., Lütcke, H., Pinnick, J., Raga, N., Thorpe, D. and Wildgaard, L. (2024) Recommendations for Data Stewardship Skills, Training and Curricula with Implementation Examples from European Countries and Universities (Version v1) [Report]. EOSC Task Force on Data Stewardship Curricula and Career Paths. Zenodo. Available at: 10.5281/zenodo.10573892
- Bezuidenhout, L., Drummond Curtis, S., Walker, B., Shanahan, H. and Alfaro Córdoba, M. (2021) ‘A school and a network: CODATA RDA Data Science Summer Schools alumni survey’, Data Science Journal, 20(10). Available at: 10.5334/dsj-2021-010
- Bezuidenhout, L., Karrar, O., Lezaun, J. and Nobes, A. (2019) ‘Economic sanctions and academia: Overlooked impact and long-term consequences’, PLOS One, 14(10), p.
e0222669 . Available at: 10.1371/journal.pone.0222669 - Bezuidenhout, L., Quick, R. and Shanahan, H. (2020) ‘“Ethics when you least expect it”: A modular approach to short course data ethics instruction’, Science and Engineering Ethics, 26(4), pp. 2189–2213. Available at: 10.1007/s11948-020-00197-2
- Biernacka, K., Bierwirth, M., Buchholz, P., Dolzycka, D., Helbig, K., Neumann, J., Odebrecht, C., Wiljes, C. and Wuttke, U. (2020) Train the trainer concept on research data management (Version 3.0). Zenodo. Available at: 10.5281/zenodo.4071471
- Biernacka, K., Helbig, K. and Buchholz, P. (2021) ‘Adaptable methods for training in research data management’, Data Science Journal, 20(1), p.
14 . Available at: 10.5334/dsj-2021-014 - Bode, J., Jaeger, P. and Schneidewind, S. (2023a) ‘Datenkompetenz im Physikstudium — ein Erfahrungsbericht’, arXiv. Available at: 10.48550/arxiv.2301.03455
- Bode, J., Jaeger, P. and Schneidewind, S. (2023b) ‘Integrating data literacy into university curricula student centred learning in undergraduate physics lab courses’, Proceedings of the Conference on Research Data Infrastructure, 1. Available at: 10.52825/CoRDI.v1i.349
- Cobe, R. (Ed.), Shanahan, H., Bezuidenhout, L., Quick, R., Peterson, B., Okorafor, E., Alfaro Córdoba, M., El Jadid, S., Venkataraman, S., Van den Eynden, V. and Gandhi, S. (2023) CODATA-RDA Schools of Research Data Science (October 1, 2020 – November 30, 2023; Version v1) [Newsletter]. Zenodo. Available at: 10.5281/zenodo.16875357
- CODATA. (2025) Research data science summer schools. Available at:
https://codata.org/initiatives/data-skills/research-data-science-summer-schools/ (Accessed: 14 February 2025). - CODATA, and Research Data Alliance. (2024) Enabling global FAIR data: WorldFAIR policy recommendations for research infrastructures [Policy brief].
- CODATA-RDA-DataScienceSchools. (2025) Materials for schools of research data science. GitHub. Available at:
https://github.com/CODATA-RDA-DataScienceSchools/Materials - Davenport, T.H. and Redman, T.C. (2022) ‘How AI is improving data management’, MIT Sloan Management Review, 63(4), pp. 101–105. Available at:
https://sloanreview.mit.edu/article/how-ai-is-improving-data-management/ - Demchenko, Y. and Stoy, H. (2021) Research data management and data stewardship competences in university curriculum. IEEE Global Engineering Education Conference (EDUCON). Available at:
https://www.uazone.org/demch/papers/educon2021-data-stewardship-competence-fw-v02.pdf - Diggs, S. (2025) UC3 New Year Series: Data publishing at CDL in 2025. UC3 – California Digital Library. Available at:
https://uc3.cdlib.org/2025/03/05/uc3-new-year-series-data-publishing-at-cdl-in-2025/ - Doehle, P., Bjornen, K. and Chartier, M. (2019) Promoting data literacy across campus with Carpentries: The experience of three librarians [PowerPoint slides]. Edmon Low Library, Oklahoma State University. Available at:
https://www.okacrl.org/wp-content/uploads/002-112_OSU_Carpentries.pptx - Gandhi, S.R. and Anyiam, F.E. (2022) ‘Urban data science education: A key actor towards improving data-driven policy-making for solving urban problems’, Journal of Education, Society and Behavioural Science, 35(5), pp. 1–14. Available at: 10.9734/jesbs/2022/v35i530421
- Goben, A. and Griffin, T.M. (2019) ‘In aggregate: Trends, needs, and opportunities from research data management surveys’, College & Research Libraries, 80(5), pp. 643–663. Available at: 10.5860/crl.80.7.903
- He, D. and Wang, L. (2023) ‘Job analyses of earth science data managers: A survey validation of competencies to inform curricula in research data management education’, Journal of Education for Library and Information Science, 64(2), pp. 104–119. Available at: 10.3138/jelis-2021-0023
- International Centre for Theoretical Physics. (2025a) The CODATA-RDA School for Research Data Science (smr 4092). ICTP. Available at:
https://indico.ictp.it/event/10857/ - International Centre for Theoretical Physics. (2025b) The CODATA-RDA Advanced Workshops for Research Data Science (smr 4168). ICTP. Available at:
https://indico.ictp.it/event/10990/ - Jordan, K.L. (2018) Evidence of Carpentries’ impact on learners. The Carpentries. Available at:
https://carpentries.org/blog/2018/07/evidence-impact/ - Jordan, K.L., Michonneau, F. and Weaver, B. (2018) Analysis of Software and Data Carpentry’s pre and post workshop surveys (Version 1) [Assessment report]. The Carpentries. Available at: 10.5281/zenodo.1325464
- Kanza, S. and Knight, N. (2021) Failed it to nailed it: Responsible data management: Legal & ethical aspects (AI3SD-Event-Series:Report-21). University of Southampton. Available at: 10.5258/SOTON/P0034
- Kanza, S. and Knight, N. (2022) ‘Behind every great research project is great data management’, BMC Research Notes, 15, p.
20 . Available at: 10.1186/s13104-022-05908-5 - Kotsis, S.V. and Chung, K.C. (2013) ‘Application of the “see one, do one, teach one” concept in surgical training’, Plastic and Reconstructive Surgery, 131(5), pp. 1194–1201. Available at: 10.1097/PRS.0b013e318287a0b3
- Krahe, M.A., Toohey, J., Wolski, M., Scuffham, P.A. and Reilly, S. (2020) ‘Research data management in practice: Results from a cross-sectional survey of health and medical researchers’, Health Information Management Journal, 49(2–3), pp. 108–116. Available at: 10.1177/1833358319831318
- LIBER Research Data Management (RDM) Working Group. (2020) ‘The 6 pillars of engaging researchers in research data management (RDM)’, LIBER (Ligue des Bibliothèques Européennes de Recherche – Association of European Research Libraries). Available at:
https://libereurope.eu/wp-content/uploads/2020/12/The-6-Pillars-of-Engaging-Researchers-in-Research-Data-Management-RDM.pdf - Maienschein, J., MacCord, K. and Elliott, S. (2019)
‘Help with data management for the novice and experienced alike’ , in G. Ramsey and A. De Block (eds.) The dynamics of science. Pittsburgh: University of Pittsburgh Press, pp. 123–140. - Majid, S., Foo, S. and Zhang, X. (2018) ‘Research data management by academics and researchers: Perceptions, knowledge and practices’, in L. Chen, Y. Liu and T.M.K.K.P. Ma (eds.) Digital libraries and knowledge organization. ICADL 2018. Lecture Notes in Computer Science, Vol 11282.
Springer , pp. 160–175. Available at: 10.1007/978-3-030-04257-8_16 - Mumuni, A.G. and Mumuni, F. (2024) ‘Automated data processing and feature engineering for deep learning and big data applications: A survey’, Journal of Information and Intelligence, 3(2), pp. 113–153. Available at: 10.1016/j.jiixd.2024.01.002
- Oo, C.Z., Chew, A.W., Wong, A.L.H., Gladding, J. and Stenstrom, C. (2021) ‘Delineating the successful features of research data management training: A systematic review’, International Journal for Academic Development, 27(3), pp. 249–264. Available at: 10.1080/1360144X.2021.1898399
- Quick, R. (2016) Computational Infrastructures at CODATA RDA Summer School in Research Data Science Aug 1–12 2016 (Version v1) [Lesson]. Zenodo. Available at: 10.5281/zenodo.154430
- Quick, R., Córdoba, M.A., Cobe, R., Peterson, B., Shanahan, H., Costantini, A., EL-Sara, EL Jadid, S., sv1uk, abellew and Bezuidenhout, L. (2023) CODATA-RDA-DataScienceSchools/Materials: Treiest2023 (Version v2023) [Software]. Zenodo. Available at: 10.5281/zenodo.8350033
- Quick, R., Córdoba, M.A., Diggs, S., Cobe, R., Bezuidenhout, L., Shannahan, H. and Peterson, B. (2023) ‘Foundational data science training for health equity researchers at minority serving institutions: A SoRDS event’, 2023 IEEE 11th International Conference on Healthcare Informatics (ICHI). Houston, TX, USA,
26–29 June 2023 .IEEE , pp. 663–667. Available at: 10.1109/ICHI57859.2023.00115 - Rantasaari, J. (2022) ‘Multi-stakeholder research data management training as a tool to improve the quality, integrity, reliability and reproducibility of research’, LIBER Quarterly, 32(1), pp. 1–54. Available at: 10.53377/lq.11726
- Read, K., Larson, C., Gillespie, C., Oh, S.Y. and Surkis, A. (2019) ‘A two-tiered curriculum to improve data management practices for researchers’, PLOS One, 14(5), p.
e0215509 . Available at: 10.1371/JOURNAL.PONE.0215509 - Schmidt, B. and Shearer, K. (2017) The WHAT and the HOW of research data management: Towards a unified view of train-the-trainer competencies. Digital Curation Centre. Available at:
https://www.dcc.ac.uk/sites/default/files/documents/IDCC17~/80_How_Why_RDM.pdf - Shanahan, H., Harrison, A. and May, S.T. (2015) ‘Teaching data science and cloud computing in low and middle income countries’, Advanced Techniques in Biology & Medicine, 3(3), p.
150 . Available at: 10.4172/2379-1764.1000150 - Shanahan, H., Hoebelheinrich, N. and Whyte, A. (2021) ‘Progress toward a comprehensive teaching approach to the FAIR data principles’, Patterns (N Y), 2(10), p.
100324 . Available at: 10.1016/j.patter.2021.100324 - Tachie, C.Y.E., Obiri-Ananey, D., Alfaro-Cordoba, M., Tawiah, N.A. and Aryee, A.N.A. (2024) ‘Classification of oils and margarines by FTIR spectroscopy in tandem with machine learning’, Food Chemistry, 431, p.
137077 . Available at: 10.1016/j.foodchem.2023.137077 - Tamm, H.C. and Nikiforova, A. (2025) From Data Quality for AI to AI for Data Quality: A Systematic Review of Tools for AI-Augmented Data Quality Management in Data Warehouses. Available at:
https://arxiv.org/abs/2406.10940 - University College Cork. (n.d.) DH5001 – Digital Humanities Programme Overview. Available at:
https://www.ucc.ie/en/dh5001/ - University of Vienna. (2022) “Data Steward” certificate programme. Research Data Management. University of Vienna. Available at:
https://rdm.univie.ac.at/data-stewards-at-the-university/become-a-data-steward/ - Wiley, C.A. and Kerby, E.E. (2018) ‘Managing research data: Graduate student and postdoctoral researcher perspectives’, Issues in Science and Technology Librarianship, 89, pp. 1–15. Available at: 10.29173/istl1725
- Wilkinson, M.D., Dumontier, M., Aalbersberg, I.J., Appleton, G., Axton, M., Baak, A., Blomberg, N., Boiten, J-W., da Silva Santos, L.B., Bourne, P.E., Bouwman, J., Brookes, A.J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C.T., Finkers, R., Gonzalez-Beltran, A., Gray, A.J.G., Groth, P., Goble, C., Grethe, J.S., Heringa, J., ‘t Hoen, P.A.C., Hooft, R., Kuhn, T., Kok, R., Kok, J., Lusher, S.J., Martone, M.E., Mons, A., Packer, A.L., Persson, B., Rocca-Serra, P., Roos, M., van Schaik, R., Sansone, S.-A., Schultes, E., Sengstag, T., Slater, T., Strawn, G., Swertz, M.A., Thompson, M., van der Lei, J., van Mulligen, E., Velterop, J., Waagmeester, A., Wittenburg, P., Wolstencroft, K., Zhao, J. and Mons, B. (2016) ‘The FAIR guiding principles for scientific data management and stewardship’, Scientific Data, 3, p.
160018 . Available at: 10.1038/sdata.2016.18 - Yang, W., Fu, R., Amin, M.B. and Kang, B.H. (2025) ‘The impact of modern AI in metadata management’, Human-Centric Intelligent Systems, 5, pp. 323–350. Available at: 10.1007/s44230-025-00106-5
- Yu, F., Deuble, R. and Morgan, H. (2017) ‘Designing research data management services based on the research lifecycle – a consultative leadership approach’, Journal of the Australian Library and Information Association, 66(3), pp. 287–298. Available at: 10.1080/24750158.2017.1364835
DOI: https://doi.org/10.5334/dsj-2026-012 | Journal eISSN: 1683-1470
Language: English
Page range: 12 - 12
Submitted on: Aug 15, 2025
Accepted on: Mar 3, 2026
Published on: Mar 19, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year
Keywords:
© 2026 Shaily Gandhi, Steve Diggs, Marcela Alfaro Córdoba, Louise Bezuidenhout, Raphael Cobe, Sara El Jadid, Bianca Peterson, Robert Quick, Hugh Shanahan, Shanmugasundaram Venkataraman, Ekpe Okorafor, Veerle Van den Eynden, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.