Performance of ChatGPT and GPT-4 on Polish National Specialty Exam (NSE) in Ophthalmology

Marcin Ciekalski; Maciej Laskowski; Agnieszka Koperczak; Maria Śmierciak; Sebastian Sirek

doi:10.2478/ahem-2024-0006

.blurhash-client-img { display: none !important; }

Performance of ChatGPT and GPT-4 on Polish National Specialty Exam (NSE) in Ophthalmology

Postępy Higieny i Medycyny Doświadczalnej

Volume 78 (2024): Issue 1 (January 2024)

By: Marcin Ciekalski , Maciej Laskowski , Agnieszka Koperczak , Maria Śmierciak and Sebastian Sirek

Open Access

|Sep 2024

Abstract

Introduction

Artificial intelligence (AI) has evolved significantly, driven by advancements in computing power and big data. Technologies like machine learning and deep learning have led to sophisticated models such as GPT-3.5 and GPT-4. This study assesses the performance of these AI models on the Polish National Specialty Exam in ophthalmology, exploring their potential to support research, education, and clinical decision-making in healthcare.

Materials and Methods

The study analyzed 98 questions from the Spring 2023 Polish NSE in Ophthalmology. Questions were categorized into five groups: Physiology & Diagnostics, Clinical & Case Questions, Treatment & Pharmacology, Surgery, and Pediatrics. GPT-3.5 and GPT-4 were tested for their accuracy in answering these questions, with a confidence rating from 1 to 5 assigned to each response. Statistical analyses, including the Chi-squared test and Mann-Whitney U test, were employed to compare the models’ performance.

Results

GPT-4 demonstrated a significant improvement over GPT-3.5, correctly answering 63.3% of questions compared to GPT-3.5’s 37.8%. GPT-4’s performance met the passing criteria for the NSE. The models showed varying degrees of accuracy across different categories, with a notable gap in fields like surgery and pediatrics.

Conclusions

The study highlights the potential of GPT models in aiding clinical decisions and educational purposes in ophthalmology. However, it also underscores the models’ limitations, particularly in specialized fields like surgery and pediatrics. The findings suggest that while AI models like GPT-3.5 and GPT-4 can significantly assist in the medical field, they require further development and fine-tuning to address specific challenges in various medical domains.

References

Deloitte Malta [Internet]. [cited 2023 Oct 21]. The Age of Artificial Intelligence: A brief history... | Deloitte Malta | RPA & AI. Available from: https://www2.deloitte.com/mt/en/pages/rpa-and-ai/articles/mt-age-of-ai-1-a-brief-history.html
Search in Google Scholar Back to article
Ting DSW, Pasquale LR, Peng L, Campbell JP, Lee AY, Raman R, Tan GSW, Schmetterer L, Keane PA, Wong TY. Artificial intelligence and deep learning in ophthalmology. Br J Ophthalmol. 2019 Feb;103(2):167–75.
Search in Google Scholar Back to article
Li Z, Wang L, Wu X, Jiang J, Qiang W, Xie H, Zhou H, Wu S, Shao Y, Chen W. Artificial intelligence in ophthalmology: The path to the real-world clinic. Cell Rep Med. 2023 Jul 18;4(7):101095.
Search in Google Scholar Back to article
Moshirfar M, Altaf AW, Stoakes IM, Tuttle JJ, Hoopes PC. Artificial intelligence in ophthalmology: a comparative analysis of GPT-3.5, GPT-4, and human expertise in answering StatPearls questions. Cureus. 2023 Jun;15(6):e40822.
Search in Google Scholar Back to article
Cai LZ, Shaheen A, Jin A, Fukui R, Yi JS, Yannuzzi N, Alabiad C. Performance of generative large language models on ophthalmology board-style questions. Am J Ophthalmol. 2023 Oct;254:141–9.
Search in Google Scholar Back to article

Articles in this issue

DOI: https://doi.org/10.2478/ahem-2024-0006 | Journal eISSN: 1732-2693

Journal RSS Feed

Language: English

Page range: 111 - 116

Submitted on: Jan 11, 2024

Accepted on: Jun 19, 2024

Published on: Sep 23, 2024

Published by: Hirszfeld Institute of Immunology and Experimental Therapy

In partnership with: Paradigm Publishing Services

Publication frequency: 1 issue per year

Keywords:

ophthalmology,

ChatGPT,

Polish national specialty exam

Related subjects:

Life sciences,

Molecular biology,

Microbiology and virology,

Medicine,

Basic medical science,

Immunology

© 2024 Marcin Ciekalski, Maciej Laskowski, Agnieszka Koperczak, Maria Śmierciak, Sebastian Sirek, published by Hirszfeld Institute of Immunology and Experimental Therapy
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Volume 78 (2024): Issue 1 (January 2024)