Exploratory Computation in Digital Humanities: A Qualitative Evaluation Framework

Cindy Anh Nguyen; Alejandro Alvarado Rojas

doi:10.5334/johd.500

Abstract

Technical benchmarking standardizes evaluation for model performance comparison, often overlooking interpretive processes of meaning-making in digital humanities exploratory research. Digital humanists engage in computational strategies to explore corpus structure and uncover thematic patterns through an iterative research design of testing different methodologies towards the goal of meaning-making. We conceptualize this form of research as exploratory computation and argue that this research design process can be clarified and communicated through transparent evaluation. This discussion article proposes a framework that qualitatively evaluates the effectiveness of a computational methodology for exploratory tasks by structuring self-reflection on positionality, task evaluation, and retrospection. Through a case study application of the framework on exploratory data visualization, we demonstrate its conceptual and practical utility for early-stage collaborative evaluation of research design. By making legible processes of humanistic interpretation, this article advocates for centering context and pedagogical transparency in computational research design and benchmarking.

References

Alvarado Rojas, A., & Twyman, M. (2025). Cultural data markets: Interpreting the popularity of public datasets. New Media & Society. 10.1177/14614448251359631
Open DOI Search in Google Scholar Back to article
Bartz-Beielstein, T., Doerr, C., Berg, D. van den, Bossek, J., Chandrasekaran, S., Eftimov, T., Fischbach, A., Kerschke, P., Cava, W. L., Lopez-Ibanez, M., Malan, K. M., Moore, J. H., Naujoks, B., Orzechowski, P., Volz, V., Wagner, M., & Weise, T. (2020). Benchmarking in optimization: Best practice and open issues. arXiv. 10.48550/arXiv.2007.03488
Open DOI Search in Google Scholar Back to article
Becker, H. S. (1996). The epistemology of qualitative research. In Ethnography and human development: Context and meaning in social inquiry (pp. 53–71). The University of Chicago Press.
Search in Google Scholar Back to article
Bender, E. M., & Koller, A. (2020). Climbing towards NLU: On meaning, form, and understanding in the age of data. In D. Jurafsky, J. Chai, N. Schluter, & J. Tetreault (Eds.), Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics (pp. 5185–5198). Association for Computational Linguistics. 10.18653/v1/2020.acl-main.463
Open DOI Search in Google Scholar Back to article
Benotto, G. (2021). Can an author style be unveiled through word distribution? Digital Humanities Quarterly, 15(1). https://dhq.digitalhumanities.org/vol/15/1/000539/000539.html
Search in Google Scholar Back to article
Broussard, M. (2018). Artificial unintelligence: How computers misunderstand the world. MIT Press. 10.7551/mitpress/11022.001.0001
Open DOI Search in Google Scholar Back to article
Buolamwini, J., & Gebru, T. (2018). Gender shades: Intersectional accuracy disparities in commercial gender classification. Proceedings of the 1st Conference on Fairness, Accountability and Transparency (PMLR 2018), USA, 81, 77–91. https://proceedings.mlr.press/v81/buolamwini18a.html
Search in Google Scholar Back to article
Campolo, A. (2021). “Thinking, judging, noticing, feeling”: John W. Tukey against the mechanization of inferential knowledge. KNOW: A Journal on the Formation of Knowledge, 5(1), 83–111. 10.1086/713021
Open DOI Search in Google Scholar Back to article
Campolo, A., & Crawford, K. (2020). Enchanted determinism: Power without responsibility in artificial intelligence. Engaging Science, Technology, and Society, 6, 1–19. 10.17351/ests2020.277
Open DOI Search in Google Scholar Back to article
Charmaz, K. (2005). Grounded theory in the twenty first century: Applications for advancing social justice studies. In N. K. Denzin & Y. S. Lincoln (Eds.) The handbook of qualitative research (3rd ed., pp. 507–535). Sage Publications Ltd.
Search in Google Scholar Back to article
Clement, T. (2016). Where is methodology in digital humanities? In L. F. Klein & M. K. Gold (Eds.) Debates in the Digital Humanities (pp. 153–175). University of Minnesota Press. 10.5749/j.ctt1cn6thb.17
Open DOI Search in Google Scholar Back to article
Clement, T., & Acker, A. (2019). Data cultures, culture as data – Special issue of cultural analytics. Journal of Cultural Analytics. 10.22148/16.035
Open DOI Search in Google Scholar Back to article
Collins, H. M., & Evans, R. (2002). The third wave of science studies: Studies of expertise and experience. Social Studies of Science, 32(2), 235–296. 10.1177/0306312702032002003
Open DOI Search in Google Scholar Back to article
Denton, R., Hanna, A., Amironesei, R., Smart, A., Nicole, H., & Scheuerman, M. K. (2020). Bringing the people back in: Contesting benchmark machine learning datasets. arXiv. 10.48550/arXiv.2007.07399
Open DOI Search in Google Scholar Back to article
Dobson, J. (2021). Interpretable outputs: Criteria for machine learning in the humanities. Digital Humanities Quarterly, 15(2). https://dhq.digitalhumanities.org/vol/15/2/000555/000555.html
Search in Google Scholar Back to article
Dombrowski, Q. (2021). Rolling the dice on project management. IDEAH, 2(2). 10.21428/f1f23564.0f419bf4
Open DOI Search in Google Scholar Back to article
Fu, K., Gurth, T., Laidlaw, D. H., & Nguyen, C. A. (2026). Visual exploration of a historical Vietnamese corpus of captioned drawings: A case study. IEEE Computer Graphics and Applications. 10.1109/MCG.2026.3660122
Open DOI Search in Google Scholar Back to article
Gibbs, F., & Owens, T. (2013). The hermeneutics of data and historical writing. In J. Dougherty & K. Nawrotzki (Eds.), Writing History in the Digital Age (pp. 159–170). University of Michigan Press. 10.2307/j.ctv65sx57.18
Open DOI Search in Google Scholar Back to article
Haraway, D. (1988). Situated knowledges: The science question in feminism and the privilege of partial perspective. Feminist Studies, 14(3), 575–599. 10.2307/3178066
Open DOI Search in Google Scholar Back to article
Harding, S. (1992). After the neutrality ideal: Science, politics, and “strong objectivity.” Social Research, 59(3), 567–587. https://www.jstor.org/stable/40970706
Search in Google Scholar Back to article
Irani, L., Vertesi, J., Dourish, P., Philip, K., & Grinter, R. E. (2010). Postcolonial computing: A lens on design and development. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, USA, 1311–1320. 10.1145/1753326.1753522
Open DOI Search in Google Scholar Back to article
Joyeux-Prunel, B. (2024). Digital humanities in the era of digital reproducibility: Towards a fairest and post-computational framework. International Journal of Digital Humanities, 6(1), 23–43. 10.1007/s42803-023-00079-6
Open DOI Search in Google Scholar Back to article
Kamberelis, G., & Dimitriadis, G. (2005). Focus groups: Strategic articulations of pedagogy, politics and inquiry. In N. K. Denzin & Y. S. Lincoln (Eds.), Collecting and interpreting qualitative materials (3rd ed., pp. 887–907). Sage Publications Ltd.
Search in Google Scholar Back to article
Kommers, C., Ahnert, R., Antoniak, M., Benetos, E., Benford, S., Bunz, M., Caramiaux, B., Concannon, S., Disley, M., Dobson, J., Du, Y., Duéñez-Guzmán, E., Francksen, K., Gius, E., Gray, J., Heuser, R., Immel, S., So, R., Leigh, S., … Hemment, D. (2025). Computational hermeneutics: Evaluating generative AI as a cultural technology. Social Science Research Network. 10.2139/ssrn.5409144
Open DOI Search in Google Scholar Back to article
Kraus, F., Blumenröhr, N., Götzelmann, G., Tonne, D., & Streit, A. (2024). A gold standard benchmark dataset for digital humanities. Proceedings of the 19th International Workshop on Ontology Matching (OM 2024) co-located with the 23rd International Semantic Web Conference (ISWC 2024), USA, 1–17. 10.5445/ir/1000178023
Open DOI Search in Google Scholar Back to article
Lazer, D. M. J., Pentland, A., Watts, D. J., Aral, S., Athey, S., Contractor, N., Freelon, D., Gonzalez-Bailon, S., King, G., Margetts, H., Nelson, A., Salganik, M. J., Strohmaier, M., Vespignani, A., & Wagner, C. (2020). Computational social science: Obstacles and opportunities. Science, 369(6507), 1060–1062. 10.1126/science.aaz8170
Open DOI Search in Google Scholar Back to article
Leonelli, S. (2018). Rethinking reproducibility as a criterion for research quality. In Research in the History of Economic Thought and Methodology: Including a aymposium on Mary Morgan: Curiosity, Imagination, and Surprise (Vol. 36B, pp. 129–146). Emerald Publishing Limited. 10.1108/S0743-41542018000036B009
Open DOI Search in Google Scholar Back to article
Munzner, T. (2009). A nested model for visualization design and validation. IEEE Transactions on Visualization and Computer Graphics, 15(6), 921–928. 10.1109/TVCG.2009.111
Open DOI Search in Google Scholar Back to article
Pankowska, P., Mendrik, A., Emery, T., & Garcia-Bernardo, J. (2023). Accelerating progress in the social sciences: The potential of benchmarks. OSF Preprints. 10.31235/osf.io/ekfxy
Open DOI Search in Google Scholar Back to article
Peng, R. D. (2011). Reproducible research in computational science. Science, 334(6060), 1226–1227. 10.1126/science.1213847
Open DOI Search in Google Scholar Back to article
Poirier, L. (2021). Reading datasets: Strategies for interpreting the politics of data signification. Big Data & Society, 8(2). 10.1177/20539517211029322
Open DOI Search in Google Scholar Back to article
Posner, M. (2014, April 17). How did they make that? The video! Miriam Posner. https://miriamposner.com/blog/how-did-they-make-that-the-video/
Search in Google Scholar Back to article
Prescott, A. (2023). Bias in big data, machine learning and AI: What lessons for the digital humanities? Digital Humanities Quarterly, 17(2). https://dhq.digitalhumanities.org/vol/17/2/000689/000689.html
Search in Google Scholar Back to article
Pustu-Iren, K., Sittel, J., Mauer, R., Bulgakowa, O., & Ewerth, R. (2020). Automated visual content analysis for film studies: Current status and challenges. Digital Humanities Quarterly, 14(4). https://dhq-static.digitalhumanities.org/pdf/000518.pdf
Search in Google Scholar Back to article
Ringler, H. (2024). Computation and hermeneutics: Why we still need interpretation to be (computational) humanists. In Computational Humanities (pp. 3–17). University of Minnesota. https://dhdebates.gc.cuny.edu/read/computational-humanities-5c64bbab-d7ca-41be-8f87-f26117a9a20f/section/cdccb7af-e7cd-4225-932a-f12da4214b1a#ch01
Search in Google Scholar Back to article
Risam, R. (2019). New digital worlds: Postcolonial digital humanities in theory, praxis, and pedagogy. Northwestern University Press. 10.2307/j.ctv7tq4hg
Open DOI Search in Google Scholar Back to article
Risam, R., & Gil, A. (2022). Introduction: The questions of minimal computing. Digital Humanities Quarterly, 16(2). https://dhq-static.digitalhumanities.org/pdf/000646.pdf
Search in Google Scholar Back to article
Sawyer, S., & Jarrahi, M. H. (2014). Sociotechnical approaches to the study of information systems. In Computing Handbook (3rd ed.). Chapman and Hall/CRC. 10.1201/b16768-7
Open DOI Search in Google Scholar Back to article
Sedlmair, M., Meyer, M., & Munzner, T. (2012). Design study methodology: Reflections from the trenches and the stacks. IEEE Transactions on Visualization and Computer Graphics, 18(12), 2431–2440. 10.1109/TVCG.2012.213
Open DOI Search in Google Scholar Back to article
Segessenmann, J., Stadelmann, T., Davison, A., & Dürr, O. (2023). Assessing deep learning: A work program for the humanities in the age of artificial intelligence. AI and Ethics, 5(1), 1–32. 10.1007/s43681-023-00408-z
Open DOI Search in Google Scholar Back to article
Siemens, L. (2016). Project management and the digital humanist. In C. Crompton, R. J. Lane, & R. Siemens (Eds.), Doing Digital Humanities: Practice, Training, Research. (pp. 343–357). Routledge. 10.4324/9781315707860
Open DOI Search in Google Scholar Back to article
Sim, S. E., Easterbrook, S., & Holt, R. C. (2003). Using benchmarking to advance research: A challenge to software engineering. Proceedings of the 25th International Conference on Software Engineering (ICSE 2003), USA, 74–83. 10.1109/ICSE.2003.1201189
Open DOI Search in Google Scholar Back to article
Tukey, J. W. (1977). Exploratory data analysis. Addison-Wesley.
Search in Google Scholar Back to article
Varela, M. E. (2021). Theater as data: Computational journeys into theater research. University of Michigan Press.
Search in Google Scholar Back to article
Weber, L. M., Saelens, W., Cannoodt, R., Soneson, C., Hapfelmeier, A., Gardner, P. P., Boulesteix, A.-L., Saeys, Y., & Robinson, M. D. (2019). Essential guidelines for computational method benchmarking. Genome Biology, 20, Article 125. 10.1186/s13059-019-1738-8
Open DOI Search in Google Scholar Back to article
Zhan, J. (2021). Call for establishing benchmark science and engineering. BenchCouncil Transactions on Benchmarks, Standards and Evaluations, 1(1), Article 100012. 10.1016/j.tbench.2021.100012
Open DOI Search in Google Scholar Back to article

Exploratory Computation in Digital Humanities: A Qualitative Evaluation Framework

Abstract

Paradigm

My account