Have a personal or library account? Click to login
Uncovering Biases Through Faceted Image Retrieval: Wikidata as a Data Provider for Art-Historical Research Cover

Uncovering Biases Through Faceted Image Retrieval: Wikidata as a Data Provider for Art-Historical Research

Open Access
|Jan 2026

Figures & Tables

Table 1

Results of the data acquisition process, showing the properties retrieved from Wikidata and their absolute frequencies.

IDENTIFIERNAMEDESCRIPTIONFREQUENCY
P18imageimage of relevant illustration of the subject […]36 179
P1476titlepublished name of a work, such as a newspaper article […]29 077
P170creatormaker of this creative work or other object […]33 616
P571inceptiontime when an entity begins to exist […]35 588
P31instance oftype to which this subject corresponds/belongs […]37 393
P2048heightvertical length of an entity35 276
P2049widthwidth of an object35 268
P186made from materialmaterial the subject or the object is made of or derived from […]66 651
P135movementliterary, artistic, scientific or philosophical movement or scene […]2668
P136genrecreative work’s genre or an artist’s field of work […]33 741
P180depictsentity visually depicted in an image, literarily described in a work […]62 848
P921main subjectprimary topic of a work or act of communication13 253
P195collectionart, museum, archival, […] of which the subject is part […]56 189
P276locationlocation of the object, structure or event […]41 557
P17countrysovereign state that this item is in […]5712
P495country of origincountry of origin of this item1210
johd-12-436-g1.png
Figure 1

Front-end design of iART. (a) On the left side of the GUI, users can access facets organized in expandable panels, while the right side displays the search results in an image grid. (b) For nuanced control of the search results, a modal dialog provides advanced configuration options.

Table 2

Mean results for concept extraction, tuple extraction, and relationship extraction in terms of precision (P), recall (R), and F1-score across different LLMs and prompting strategies. The averages are calculated over three iterations. The best-performing approach is indicated in bold.

LLMPROMPTCONCEPT EXTRACTIONTUPLE EXTRACTIONRELATION EXTRACTION
PRF1PRF1PRF1
Micro
Qwen3CoT0.8080.8590.8330.7210.7250.7230.4680.4710.469
Role-CoT0.5600.7060.6240.3840.5030.4350.2060.2710.234
ReAct0.6980.6710.6840.5390.5080.5230.3570.3430.350
Gemma 3CoT0.7760.8570.8150.6390.6650.6520.4220.4480.435
Role-CoT0.5710.7820.6600.3790.4970.4300.1500.2370.184
ReAct0.7570.8160.7850.6090.6200.6140.3550.3650.360
LLaMA 3.1CoT0.1860.0740.1060.2150.1090.1450.1670.0850.112
Role-CoT0.5030.3620.4210.4440.2940.3540.2600.1790.212
ReAct0.3930.2050.2690.3910.2170.2790.2650.1470.189
Macro
Qwen3CoT0.7890.8570.8020.7100.6870.6840.5240.5180.511
Role-CoT0.6510.7820.6620.5190.5370.4900.2940.3020.285
ReAct0.6080.6210.5900.5810.5470.5430.3860.3900.375
Gemma 3CoT0.7810.8670.7950.6140.6250.5910.4040.4400.404
Role-CoT0.6040.7880.6440.4670.5260.4570.1730.2640.189
ReAct0.7770.8300.7740.6280.5750.5680.3280.3210.309
LLaMA 3.1CoT0.1030.1120.0950.2270.2070.2120.1760.1760.176
Role-CoT0.3250.3570.3260.3820.3270.3400.2230.2240.218
ReAct0.1920.1950.1790.3290.2880.3000.2600.2390.246
Table 3

Results for relation extraction across different LLMs and prompting strategies. For each model-prompt configuration, we report the number of concepts |ℂ|, the total number of predicted tuples |ℙ|, the total number of triplets (|T^|), the number of Iconclass notations with valid triplets (|ℕ|), the number without valid triplets (||), and the mean number of triplets per Iconclass notations, i.e., |T^|/||.

LLMPROMPT|ℂ||ℙ||T^||ℕ||||T^|/||
Qwen3CoT24116216471142.3
Role-CoT3022232267872.9
ReAct20414014358272.5
Gemma 3CoT2561821868322.2
Role-CoT3252312798413.3
ReAct2471761788142.2
LLaMA 3.1CoT2513138771.6
Role-CoT131677235502.1
ReAct57242411742.2
Table 4

Qualitative comparison of ground-truth and predicted triplets (represented in the form subject-predicate-object) for selected Iconclass notations N, with associated descriptions DN shown in parentheses.

NGROUND TRUTHPREDICTION
SUBJECTPREDICATEOBJECTSUBJECTPREDICATEOBJECT
93G (House of Sleep: a gloomy cave through which runs the river Lethe; possibly with two gates, […])
River Letheruns throughHouse of SleepHouse of Sleepruns_throughriver Lethe
ivory gateissuesdeceptive dreamsivory gateissuesdeceptive dreams
horn gateissuestrue dreamshorn gateissuestrue dreams
House of Sleepcontainstwo gatesHouse of Sleephas_gateivory gate
House of Sleephas_gatehorn gate
11G311 (Michael resting beside the dragon’s corpse)
Michaelrests besidedragon’s corpseMichaelresting_besidedragon’s corpse
Dragonrepresented ascorpse
11U4 (Mary and John the Baptist together with (e.g. kneeling before) the judging Christ, ’Deesis’ Last Judgement)
Marykneels beforeChristMarykneels_beforejudging Christ
John the Baptistkneels beforeChristJohn the Baptistkneels_beforejudging Christ
Marytogether withJohn the Baptist
Christrepresented asJudge
29B (Plants behaving as human beings or animals)
plantsbehave likehuman beingsplantsbehaving_ashuman beings
plantsbehave likeanimalsplantsbehaving_asanimals
94P68 ((story of) Perseus – death)
PerseuskillsMedusa
johd-12-436-g2.png
Figure 2

Search results in iART for the queries “Amor” (a) and “Pietà” (b), with the facets “time start” and “location” expanded.

johd-12-436-g3.png
Figure 3

Search results in iART for the queries “wisdom” (a) and “beauty” (b), with the facets “time start” and “depicts” expanded.

johd-12-436-g4.png
Figure 4

Search results in iART for the queries “warrior” (a) and “market” (b), with the facets “time start” and “material” expanded.

johd-12-436-g5.png
Figure 5

Search results in iART for the queries “nude” (a) and “naked” (b), with the facets “time start” and “depicts” expanded.

DOI: https://doi.org/10.5334/johd.436 | Journal eISSN: 2059-481X
Language: English
Submitted on: Oct 27, 2025
|
Accepted on: Dec 16, 2025
|
Published on: Jan 16, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Stefanie Schneider, Matthias Springstein, Julian Stalter, Daniel Ritter, Ralph Ewerth, Eric Müller-Budack, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.