ReviewAid: An Open-Source Tool for Efficient PICO-Based Screening and Data Extraction in Systematic Reviews

Vihaan Sahu; Mohith Balakrishnan

doi:10.5334/jors.672

Abstract

Systematic reviews are widely utilized as a rigorous method for synthesizing scientific evidence, yet the manual process of screening literature and extracting data remains a time-consuming bottleneck. Researchers often spend weeks sifting through thousands of articles to identify relevant data. This paper presents ReviewAid (v2.1.0), an open-source, AI-powered software tool designed to automate and accelerate this workflow. Unlike single-model solutions, ReviewAid offers a flexible architecture supporting multiple AI providers, including OpenAI, Anthropic, DeepSeek, Cohere, Z.ai, and local execution via Ollama, allowing researchers to balance cost, speed, and privacy. Built on Python and Streamlit, ReviewAid facilitates “PICO-based” screening, a structured method used in healthcare to identify studies based on Population, Intervention, Comparison, and Outcome, as well as customizable data extraction. A significant challenge in using AI for research is that models often produce “malformed” outputs, such as text with broken formatting, which can cause standard software to crash. To address this, ReviewAid introduces a ‘Bulletproof Parsing Pipeline’ designed to recover data from these imperfect responses. Additionally, it features a hierarchical four-tier confidence scoring system to quantify the certainty of AI decisions. Software validation on over 100 articles demonstrated high processing speeds and robust error handling. ReviewAid is architected as a decision-support tool, not as a replacement for human judgment, but as a ‘third reference’ layer to assist the review process, distributed under an Apache 2.0 license.

References

Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD, et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ. 2021;372:n71. DOI: 10.1136/bmj.n712
Open DOI Search in Google Scholar Back to article
Borah R, Brown AW, Capers PL, Kaiser KA. Analysis of the time and workers needed to conduct systematic reviews of medical interventions using data from the PROSPERO registry. BMJ Open. 2017;7(2):e012545. DOI: 10.1136/bmjopen-2016-012545
Open DOI Search in Google Scholar Back to article
O’Mara-Eves A, Thomas J, McNaught J, Miwa M, Ananiadou S. Using text mining for study identification in systematic reviews: A feasibility study. Systematic Reviews. 2015;4(1):1–14. DOI: 10.1186/2046-4053-4-5
Open DOI Search in Google Scholar Back to article
Bedi S, Liu Y, Orr-Ewing L, Dash D, Koyejo S, Callhan A, et al. Testing and evaluation of health care applications of large language models: A systematic review. JAMA. 2025;333(4):319–328. DOI: 10.1001/jama.2024.21700
Open DOI Search in Google Scholar Back to article
Ji Z, Lee N, Frieske R, Yu T, Su D, Xu Y, et al. Survey of hallucination in natural language generation. ACM Computing Surveys. 2023;55(12):1–38. DOI: 10.48550/arXiv.2202.036296.
Open DOI Search in Google Scholar Back to article
Z.ai Team. GLM-4.6V-Flash: A multimodal large language model for high-speed document understanding. arXiv preprint arXiv:2507.01006; 2025.
Search in Google Scholar Back to article
Sahu V. ReviewAid: AI-driven full-text screening and data extraction for systematic reviews and evidence synthesis (v2.1.0). Zenodo; 2025. DOI: 10.5281/zenodo.18060972
Open DOI Search in Google Scholar Back to article
Bakker JP, Goldsack JC, Clarke M, Coravos A, Geoghegan C, Godfrey A, et al. A systematic review of feasibility studies promoting the use of mobile technologies in clinical research. npj Digital Medicine. 2019;2:47. DOI: 10.1038/s41746-019-0125-x
Open DOI Search in Google Scholar Back to article
Van de Schoot R, De Bruin J, Schram R, Zahedi P, De Boer J, Weijdema F, et al. Open source software for efficient and transparent reviews arXiv; 2020. DOI: 10.48550/arXiv.2006.12166
Open DOI Search in Google Scholar Back to article
Bozada T, Borden J, Workman J, Del Cid M, Malinowski J, Luechtefeld T. Sysrev: A FAIR platform for data curation and systematic evidence review. Frontiers in Artificial Intelligence. 2021;4:685298. DOI: 10.3389/frai.2021.685298
Open DOI Search in Google Scholar Back to article
OpenAI. Introducing GPT4o [Internet]. San Francisco, CA: OpenAI; 2024 May 13 [cited 2026 Jan 09]. Available from: https://openai.com/index/hellogpt4o/
Search in Google Scholar Back to article
Anthropic. Introducing Claude 4 Sonnet [Internet]. San Francisco, CA: Anthropic; 2025 May 14 [cited 2026 Jan 09]. Available from: https://www.anthropic.com/news/claude-4-sonnet
Search in Google Scholar Back to article
Bi X, Chen D, Chen G, Chen S, Dai D, Deng C, et al. DeepSeek LLM: Scaling open-source language models with longtermism. arXiv [preprint]. 2024 Jan 5 [cited 2026 Jan 09]. Available from: https://arxiv.org/abs/2401.02954
Search in Google Scholar Back to article
Cohere. Introducing Command A [Internet]. Redwood City, CA: Cohere; 2025 Mar 03 [cited 2026 Jan 09]. Available from: https://www.cohere.com/blog/command-a
Search in Google Scholar Back to article
Meta. Introducing LLaMA 3 [Internet]. Menlo Park, CA: Meta AI; 2024 Apr 18 [cited 2026 Jan 09]. Available from: https://ai.meta.com/blog/metallama3/
Search in Google Scholar Back to article
Bringeland GH, Blaser N, Myhr KM, Vedeler CA, Gavasso S. Wearing-off symptoms during standard and extended natalizumab dosing intervals: Experiences from the COVID-19 pandemic. Journal of the Neurological Sciences. 2021 Oct 15;429:117622. DOI: 10.1016/j.jns.2021.117622
Open DOI Search in Google Scholar Back to article
Landi D, Cola G, Mantero V, Balgera R, Moiola L, Nozzolillo A, et al. Safety of natalizumab infusion in multiple sclerosis patients during active SARS-CoV-2 infection. Multiple Sclerosis and Related Disorders. 2022 Jan;57:103345. DOI: 10.1016/j.msard.2021.103345
Open DOI Search in Google Scholar Back to article
Toljan K, Conway DS. Extended interval dosing of natalizumab: More evidence in support. Neurotherapeutics. 2024 Apr;21(3):e00351. DOI: 10.1016/j.neurot.2024.e00351
Open DOI Search in Google Scholar Back to article
Covidence systematic review software, Veritas Health Innovation, Melbourne, Australia. Available at https://support.covidence.org/help/how-can-i-cite-covidence
Search in Google Scholar Back to article
EndNote [computer program]. Clarivate; 2024. Available from: https://www.endnote.com
Search in Google Scholar Back to article
Corporation for Digital Scholarship. Zotero 7.0 [Computer software; 2024. Available from: https://www.zotero.org/
Search in Google Scholar Back to article
Streamlit Inc. Streamlit [computer software]. 2025. Available from: https://streamlit.io/
Search in Google Scholar Back to article
Python Software Foundation. Python: A dynamic, open-source programming language. 2023. Available at: https://www.python.org/
Search in Google Scholar Back to article
McKie JX, Liu R. PyMuPDF: Python bindings for MuPDF [computer program on the Internet]. Version 1.26.3. San Francisco, CA: Artifex Software, Inc.; 2025 Jul 2. Available from: github.com/pymupdf/PyMuPDF
Search in Google Scholar Back to article
McKinney W. Data structures for statistical computing in Python. In: van der Walt S, Millman J, editors. Proceedings of the 9th Python in Science Conference, SciPy; 2010 June 28–July 3. Austin, TX. p. 51–6. DOI: 10.25080/Majora-92bf1922-00a
Open DOI Search in Google Scholar Back to article
Ouzzani M, Hammady H, Fedorowicz Z, Elmagarmid A. Rayyan-a web and mobile app for systematic reviews. Systematic Reviews. 2016 Dec 5;5(1):210. DOI: 10.1186/s13643-016-0384-4
Open DOI Search in Google Scholar Back to article
Python-DOCX. python-docx documentation [Internet]; 2024. Available from: https://python-docx.readthedocs.io/
Search in Google Scholar Back to article
McNamara J. Creating Excel files with Python and XlsxWriter [Internet]. 2024. Available from: https://xlsxwriter.readthedocs.io/
Search in Google Scholar Back to article
Jackson K. JSON5 – JSON for humans [Internet]; 2024. Available from: https://json5.org/
Search in Google Scholar Back to article
Ollama. Ollama: Get up and running with large language models [Internet]. 2023. Available from: https://ollama.com
Search in Google Scholar Back to article

ReviewAid: An Open-Source Tool for Efficient PICO-Based Screening and Data Extraction in Systematic Reviews

Abstract

Paradigm

My account