Evidence-Grounded Decision Support for Aircraft Line Maintenance Using Conformal Prediction and Retrieval-Augmented NLP from Technical Log Records

Arthur Dela Peña; Jefferson Clariza; Mary Ann Aballiar-Vista

doi:10.2478/tar-2026-0009

.blurhash-client-img { display: none !important; }

Evidence-Grounded Decision Support for Aircraft Line Maintenance Using Conformal Prediction and Retrieval-Augmented NLP from Technical Log Records

Transactions on Aerospace Research

Volume 2026 (2026): Issue 2 (June 2026)

By: Arthur Dela Peña , Jefferson Clariza and Mary Ann Aballiar-Vista

Open Access

|Jun 2026

Stanton I, Munir K, Ikram A, El-Bakry M. Predictive maintenance analytics and implementation for aircraft: Challenges and opportunities. Syst Eng. 2023;26(2):216–237. https://doi.org/10.1002/sys.21651
Search in Google Scholar Back to article
Scott MJ, Verhagen WJC, Bieber MT, Marzocca P. A systematic literature review of predictive maintenance for defence fixed-wing aircraft sustainment and operations. Sensors. 2022;22(18). https://doi.org/10.3390/s22187070
Search in Google Scholar Back to article
Shaukat S, Katscher M, Wu C-L, Delgado F, Larraín H. Aircraft line maintenance scheduling and optimisation. J Air Transp Manag. 2020;89:101914. https://doi.org/10.1016/j.jairtraman.2020.101914
Search in Google Scholar Back to article
Kála M, Lališ A, Vojtěch T. Analyzing aircraft maintenance findings with natural language processing. Transp Res Procedia. 2022;65:238–245. https://doi.org/10.1016/j.trpro.2022.11.028
Search in Google Scholar Back to article
Garcia J, Rios-Colque L, Peña A, Rojas L. Condition monitoring and predictive maintenance in industrial equipment: An NLP-assisted review of signal processing, hybrid models, and implementation challenges. Appl Sci. 2025;15(10):5465. https://doi.org/10.3390/app15105465
Search in Google Scholar Back to article
Dangut MD, Jennions IK, King S, Skaf Z. A rare failure detection model for aircraft predictive maintenance modelling. Neural Comput Appl. 2023;35:2991–3009. https://doi.org/10.1007/s00521-022-07167-8
Search in Google Scholar Back to article
Kostopoulos G, Davrazos G, Kotsiantis S. Explainable artificial intelligence-based decision support systems: A recent review. Electronics. 2024;13(14):2842. https://doi.org/10.3390/electronics13142842
Search in Google Scholar Back to article
Pelosi D, Cacciagrano D, Piangerelli M. Explainability and interpretability in concept and data drift. Algorithms. 2025;18(7):443. https://doi.org/10.3390/a18070443
Search in Google Scholar Back to article
Heredia Álvaro JA, Barreda JG. An advanced retrieval-augmented generation system for manufacturing quality control. Adv Eng Inform. 2025 Mar;64:103007. https://doi.org/10.1016/j.aei.2024.103007
Search in Google Scholar Back to article
Ludwig H, Schmidt T, Kühn M. An ontology-based retrieval augmented generation procedure for a voice-controlled maintenance assistant. Computers in Industry. 2025;159:104289. https://doi.org/10.1016/j.compind.2025.104289
Search in Google Scholar Back to article
Mortier T, Wydmuch M, Dembczyński K, Hüllermeier E, Waegeman W. Efficient set-valued prediction in multi-class classification. Data Mining and Knowledge Discovery. 2021;35(4):1435–1469. https://doi.org/10.1007/s10618-021-00751-x
Search in Google Scholar Back to article
Johansson U, Löfström T, Sönströd C, Löfström H. Conformal prediction for accuracy guarantees in classification with reject option. In: International Conference on Modeling Decisions for Artificial Intelligence (MDAI 2023); 2023. p. 133–145. https://doi.org/10.1007/978-3-031-33498-6_9
Search in Google Scholar Back to article
Artelt A, Visser R, Hammer B. “I do not know! but why?” — Local model-agnostic example-based explanations of reject. Neurocomputing. 2023;558:126722. https://doi.org/10.1016/j.neucom.2023.126722
Search in Google Scholar Back to article
Campos MM, Farinhas A, Zerva C, Figueiredo MAT, Martins AFT. Conformal prediction for natural language processing: A survey. Trans Assoc Comput Linguist. 2024;12:1497–1516. https://doi.org/10.1162/tacl_a_00715
Search in Google Scholar Back to article
Bhardwaj AS, Veeramani D, Zhou S. Confidently extracting hierarchical taxonomy information from unstructured maintenance records of industrial equipment. Int J Prod Res. 2023;61(23):8159–8178. https://doi.org/10.1080/00207543.2023.2167013
Search in Google Scholar Back to article
Nanyonga A, Joiner K, Turhan U, Wild G. Does the choice of topic modeling technique impact the interpretation of aviation incident reports? A methodological assessment. Technologies. 2025;13(5) :209. https://doi.org/10.3390/technologies13050209
Search in Google Scholar Back to article
Mohd Sharif MM, Maskat R, Baharum Z, Maskat K. A scoping review of topic modelling on online data. Indones J Electr Eng Comput Sci. 2023;31(3):1633–1641. https://doi.org/10.11591/ijeecs.v31.i3.pp1633-1641
Search in Google Scholar Back to article
Blanchy G, Albrecht L, Koestel J, Garré S. Potential of natural language processing for metadata extraction from environmental scientific publications. SOIL. 2023;9:155–168. https://doi.org/10.5194/soil-9-155-2023
Search in Google Scholar Back to article
Lam BD, Chrysafi P, Chiasakul T, Khosla H, Karagkouni D, McNichol M, Adamski A, Reyes N, Abe K, Mantha S, Vlachos IS, Zwicker JI, Patell R. Machine learning natural language processing for identifying venous thromboembolism: Systematic review and meta-analysis. Blood Adv. 2024;8(12):2991–3000. https://doi.org/10.1182/bloodadvances.2023012200
Search in Google Scholar Back to article
Szrama S. Optimizing aircraft engine longevity: A comparative framework for dynamically adaptive predictive maintenance using autoencoders, LSTMs, and Gaussian processes. Eng Appl Artif Intell. 2025;156:111199. https://doi.org/10.1016/j.engappai.2025.111199
Search in Google Scholar Back to article
Szrama S, Lodygowski T. Turbofan engine health status prediction with artificial neural network. Aviation. 2024;28(4):225–234. https://doi.org/10.3846/aviation.2024.22554
Search in Google Scholar Back to article
Park S, Kwon N, Ahn Y. Forecasting repair schedule for building components based on case-based reasoning and fuzzy-AHP. Sustainability. 2019;11(24):7181. https://doi.org/10.3390/su11247181
Search in Google Scholar Back to article
Wu H, Zhong B, Medjdoub B, Xing X, Jiao L. An ontological metro accident case retrieval using CBR and NLP. Appl Sci. 2020;10(15):5298. https://doi.org/10.3390/app10155298
Search in Google Scholar Back to article
Bousdekis A, Lepenioti K, Apostolou D, Mentzas G. A review of data-driven decision-making methods for Industry 4.0 maintenance applications. Electronics. 2021;10(7):828. https://doi.org/10.3390/electronics10070828
Search in Google Scholar Back to article
Peng B, Zhu Y, Liu Y, Bo X, Shi H, Hong C, Zhang Y, Tang S. Graph retrieval-augmented generation: A survey. ACM Trans Inf Syst. 2025;44(2):Article 35, 1–52. https://doi.org/10.1145/3777378
Search in Google Scholar Back to article
Rojas L, Hernandez B, Garcia J. A systematic review of intelligent agents, language models, and recurrent neural networks in industrial maintenance: Driving value creation for the mining sector. Int J Intell Syst. 2024;2025(1):9953223. https://doi.org/10.1155/int/9953223
Search in Google Scholar Back to article
Wang B, Wu J, Shi Y, et al. Structured reflective reasoning for precise medical knowledge graph retrieval augmented generation. Health Inf Sci Syst. 2025;13(1):76. https://doi.org/10.1007/s13755-025-00390-2
Search in Google Scholar Back to article
Lodygowski T, Szrama S. Unsupervised classification and remaining useful life prediction for turbofan engines using autoencoders and Gaussian mixture models: A comprehensive framework for predictive maintenance. Appl Sci. 2025; 15(14): 7884. https://doi.org/10.3390/app15147884
Search in Google Scholar Back to article
Szrama S, Szymański G, Mokrzan D. Aircraft propulsion health status prognostics and prediction. Adv Sci Technol Res. 2025;19(5):321–335. https://doi.org/10.12913/22998624/202232
Search in Google Scholar Back to article
Kompa B, Snoek J, Beam AL. Second opinion needed: Communicating uncertainty in medical machine learning. NPJ Digit Med. 2021;4(1):4. https://doi.org/10.1038/s41746-020-00367-3
Search in Google Scholar Back to article
Nemani V, Biggio L, Huan X, Hu Z, Fink O, Tran A, et al. Uncertainty quantification in machine learning for engineering design and health prognostics: A tutorial. Mech Syst Signal Process. 2023;205:110796. https://doi.org/10.1016/j.ymssp.2023.110796
Search in Google Scholar Back to article
Loftus TJ, Shickel B, Ruppert MM, Balch JA, Ozrazgat-Baslanti T, Tighe PJ, Efron PA, Hogan WR, Rashidi P, Upchurch GR, Bihorac A. Uncertainty-aware deep learning in healthcare: A scoping review. PLOS Digit Health. 2022;1(8):e0000085. https://doi.org/10.1371/journal.pdig.0000085
Search in Google Scholar Back to article
Singh Y, Hathaway QA, Keishing V, Salehi S, Wei Y, Horvat N, Vera-Garcia DV, Choudhary A, Mula Kh A, Quaia E, et al. Beyond post hoc explanations: A comprehensive framework for accountable AI in medical imaging through transparency, interpretability, and explainability. Bioengineering. 2025;12(8):879. https://doi.org/10.3390/bioengineering12080879
Search in Google Scholar Back to article
de Gelder E, Op den Camp O. How certain are we that our automated driving system is safe? Traffic Inj Prev. 2023;24(sup1):S131–S140. https://doi.org/10.1080/15389588.2023.2186733
Search in Google Scholar Back to article
Convery O, Smith L, Gal Y, Hanuka A. Uncertainty quantification for virtual diagnostic of particle accelerators. Phys Rev Accel Beams. 2021;24(7):074602. https://doi.org/10.1103/PhysRevAccelBeams.24.074602
Search in Google Scholar Back to article
dos Santos Silva GF, Barcellos Filho FN, Wichmann RM, da Silva Junior FC, Porto Chiavegatto Filho AD. Strategies for detecting and mitigating dataset shift in machine learning for health predictions: A systematic review. J Biomed Inform. 2025;170:104902. https://doi.org/10.1016/j.jbi.2025.104902
Search in Google Scholar Back to article
Guo LL, Pfohl SR, Fries J, Posada J, Fleming SL, Aftandilian C, Shah N, Sung L. Systematic review of approaches to preserve machine learning performance in the presence of temporal dataset shift in clinical medicine. Appl Clin Inform. 2021; 12(4):808–815. https://doi.org/10.1055/s-0041-1735184
Search in Google Scholar Back to article
Cabanillas Silva P, Sun H, Rezk M, Roccaro-Waldmeyer DM, Fliegenschmidt J, Hulde N, von Dossow V, Meesseman L, Depraetere K, Stieg J, Szymanowsky R, Dahlweid FM. Longitudinal model shifts of machine learning-based clinical risk prediction models: Evaluation study of multiple use cases across different hospitals. J Med Res. 2024;26:e51409. https://doi.org/10.2196/51409
Search in Google Scholar Back to article
Sáez C, Gutiérrez-Sacristán A, Kohane I, García-Gómez JM, Avillach P. EHRtemporalVariability: Delineating temporal dataset shifts in electronic health records. GigaScience. 2020;9(8):giaa079. https://doi.org/10.1093/gigascience/giaa079
Search in Google Scholar Back to article
Allgaier J, Pryss R. Practical approaches in evaluating validation and biases of machine learning applied to mobile health studies. Commun Med. 2024;4(1):76. https://doi.org/10.1038/s43856-024-00468-0
Search in Google Scholar Back to article
Gardner AL, Charlesworth M. How to write a retrospective observational study. Anaesthesia. 2023;78:521–525. https://doi.org/10.1111/anae.15831
Search in Google Scholar Back to article
Ghaferi AA, Schwartz TA, Pawlik TM. STROBE reporting guidelines for observational studies. JAMA Surg. 2021;156(6):577–578. https://doi.org/10.1001/jamasurg.2021.0528
Search in Google Scholar Back to article
Negash B, Katz A, Neilson CJ, Moni M, Nesca M, Singer A, Enns JE. Deidentification of free text data containing personal health information: A scoping review of reviews. Int J Popul Data Sci. 2023;8(1). https://doi.org/10.23889/ijpds.v8i1.2153
Search in Google Scholar Back to article
Lulamba TE, Mutemaringa T, Tiffin N. Ten quick tips for protecting health data using de-identification and perturbation of structured datasets. PLOS Comput Biol. 2025;21(9):e1013507. https://doi.org/10.1371/journal.pcbi.1013507
Search in Google Scholar Back to article
Usuga-Cadavid JP, Lamouri S, Grabot B, Fortin A. Using deep learning to value free form text data for predictive maintenance. Int J Prod Res. 2022;60(14): 4548–4575. https://doi.org/10.1080/00207543.2021.1951868
Search in Google Scholar Back to article
Opitz J. A closer look at classification evaluation metrics and a critical reflection of common evaluation practice. Trans Assoc Comput Linguist. 2024;12:820–836. https://doi.org/10.1162/tacl_a_00675
Search in Google Scholar Back to article
Valcarce D, Bellogín A, Parapar J, Castells P. Assessing ranking metrics in top-N recommendation. Inf Retr J. 2020;23:411–448. https://doi.org/10.1007/s10791-020-09377-x
Search in Google Scholar Back to article
Heil BJ, Hoffman MM, Markowetz F, Lee SI, Greene CS, Hicks SC. Reproducibility standards for machine learning in the life sciences. Nat Methods. 2021;18(10):1132–1135. https://doi.org/10.1038/s41592-021-01256-7
Search in Google Scholar Back to article
Pineau J, Vincent-Lamarre P, Sinha K, Larivière V, Beygelzimer A, d’Alché-Buc F, Fox E, Larochelle H. Improving reproducibility in machine learning research (a report from the NeurIPS 2019 reproducibility program). J Mach Learn Res. 2021;22(164):1–20. https://doi.org/10.48550/arXiv.2003.12206
Search in Google Scholar Back to article

Authors

Metrics

Articles in this issue

DOI: https://doi.org/10.2478/tar-2026-0009 | Journal eISSN: 2545-2835

Journal RSS Feed

Language: English

Page range: 53 - 85

Submitted on: Jan 23, 2026

Accepted on: Mar 16, 2026

Published on: Jun 17, 2026

Published by: ŁUKASIEWICZ RESEARCH NETWORK – INSTITUTE OF AVIATION

In partnership with: Paradigm Publishing Services

Keywords:

aircraft maintenance diagnostics,

line maintenance triage,

JASC code classification,

evidence-grounded retrieval,

conformal prediction,

abstention

Related subjects:

Engineering,

Introductions and overviews,

Materials sciences, other,

Physics,

Physics, other

© 2026 Arthur Dela Peña, Jefferson Clariza, Mary Ann Aballiar-Vista, published by ŁUKASIEWICZ RESEARCH NETWORK – INSTITUTE OF AVIATION
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Volume 2026 (2026): Issue 2 (June 2026)