metaScreener: A Plugin-Based Desktop Application for Human-in-the-Loop Systematic Literature Screening

Alejandro Reyes-Consuelo; Jocelyne Kiss; Julien Voisin

doi:10.5334/jors.730

metaScreener: A Plugin-Based Desktop Application for Human-in-the-Loop Systematic Literature Screening

Journal of Open Research Software

Volume 14 (2026): Issue 1

By: Alejandro Reyes-Consuelo , Jocelyne Kiss and Julien Voisin

Open Access

|Jun 2026

Figures & Tables

metaScreener pipeline architecture. Plugins are grouped by functional role; arrows indicate bundle data flow between stages.

Table 1

Plugin inventory for metaScreener version 3. T = inference temperature. Plugin 03 produces a structured criteria_harmonized.csv file consumed by all four downstream filtering plugins.

#	PLUGIN	FUNCTION	METHOD
01	Reference Markers (experimental)	Extracts visually-present reference markers (e.g., `[1]`) from images supplied as PDF or PNG; not designed for standard PRISMA flow diagrams	GPT-4o vision API
02	References-of-X AI	Resolves and enriches bibliographic references via federated API queries	OpenAlex, Crossref, Semantic Scholar
03	Criteria Parser	Converts free-text inclusion/exclusion criteria into a structured, machine-readable criteria file	Rule-based inference, optional LLM refinement
04	EH (Exclusion by Heuristic)	Removes records matching any exclusion criterion at title/abstract level	Deterministic keyword/regex
05	IH (Inclusion by Heuristic)	Retains only records matching at least one inclusion criterion at title/abstract level	Deterministic keyword/regex
06	EL (Exclusion by LLM)	Applies LLM-based eligibility adjudication against exclusion criteria over full record text	OpenAI-compatible endpoint, T=0.0
07	IL (Inclusion by LLM)	Applies LLM-based eligibility adjudication against inclusion criteria over full record text	OpenAI-compatible endpoint, T=0.0

Table 2

Sequential screening funnel for the demonstration use case (initial corpus $N = 776$ ).

STAGE	INPUT	SURVIVORS	EXCLUDED	PRIMARY EXCLUSION REASON
Initial corpus	776	776	—	752 English, 14 French; years 1962–2025
EH (Exclusion by Heuristic)	776	651	125	Conference proceedings ( $n = 112$ ); non-English ( $n = 13$ )
IH (Inclusion by Heuristic)	651	85	566	Publication year < 2018 ( $n = 556$ ); non-English ( $n = 10$ )
EL (Exclusion by LLM)	85	85	0	No records met exclusion criteria
IL (Inclusion by LLM)	85	73	12	Did not meet HMD VR inclusion criterion (IC-4)
Final review corpus	—	73	703	90.6% reduction from initial corpus

Sequential screening funnel for the demonstration use case. Excluded records are shown with exclusion counts at each stage transition.

metaScreener desktop interface, shown on the Criteria Parser plugin (Plugin 03). The left panel accepts free-text inclusion and exclusion criteria; the right panel displays the structured harmonized table, with each row’s pipeline-stage assignment (EH/IH/EL/IL) and matching operator determined by the rule-based inference engine described in Algorithm 1. The log panel at the bottom shows the harmonizer parsing eight criteria and applying optional LLM refinement.

Table 3

Human-versus-LLM agreement on the three LLM-adjudicated criteria from the demonstration corpus. Cohen’s $κ$ is computed between the human aggregate decision and the LLM canonical decision; Fleiss’ $κ$ is computed across the three raters on the 15-record overlap subset per stage. $P_{obs}$ is the percent observed agreement (human vs. LLM) over the same N.

STAGE	CRITERION	COHEN’S $κ$	FLEISS’ $κ$	$P_{obs}$	N
EL	EC-2 (spatial-navigation focus)	–0.05	–0.13	83.5%	85
EL	EC-3 (rubber-hand-illusion focus)	0.10	–0.05	87.1%	85
IL	IC-1 (HMD VR/virtual simulation)	0.28	0.26	56.0%	84

References

Authors

Metrics

Articles in this issue

DOI: https://doi.org/10.5334/jors.730 | Journal eISSN: 2049-9647

Journal RSS Feed

Language: English

Page range: 45 - 45

Submitted on: Mar 31, 2026

Accepted on: May 20, 2026

Published on: Jun 5, 2026

Published by: Ubiquity Press

In partnership with: Paradigm Publishing Services

Publication frequency: 1 issue per year

Keywords:

systematic review,

literature screening,

screening automation,

human-in-the-loop AI,

large language models,

© 2026 Alejandro Reyes-Consuelo, Jocelyne Kiss, Julien Voisin, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.

Volume 14 (2026): Issue 1

metaScreener: A Plugin-Based Desktop Application for Human-in-the-Loop Systematic Literature Screening

Figures & Tables

Figure 1

Table 1

Algorithm 1

Algorithm 2

Algorithm 3

Table 2

Figure 2

Figure 3

Table 3

Paradigm

My account