Evidence-Grounded Decision Support for Aircraft Line Maintenance Using Conformal Prediction and Retrieval-Augmented NLP from Technical Log Records
References
- Stanton I, Munir K, Ikram A, El-Bakry M. Predictive maintenance analytics and implementation for aircraft: Challenges and opportunities. Syst Eng. 2023;26(2):216–237. https://doi.org/10.1002/sys.21651
- Scott MJ, Verhagen WJC, Bieber MT, Marzocca P. A systematic literature review of predictive maintenance for defence fixed-wing aircraft sustainment and operations. Sensors. 2022;22(18). https://doi.org/10.3390/s22187070
- Shaukat S, Katscher M, Wu C-L, Delgado F, Larraín H. Aircraft line maintenance scheduling and optimisation. J Air Transp Manag. 2020;89:101914. https://doi.org/10.1016/j.jairtraman.2020.101914
- Kála M, Lališ A, Vojtěch T. Analyzing aircraft maintenance findings with natural language processing. Transp Res Procedia. 2022;65:238–245. https://doi.org/10.1016/j.trpro.2022.11.028
- Garcia J, Rios-Colque L, Peña A, Rojas L. Condition monitoring and predictive maintenance in industrial equipment: An NLP-assisted review of signal processing, hybrid models, and implementation challenges. Appl Sci. 2025;15(10):5465. https://doi.org/10.3390/app15105465
- Dangut MD, Jennions IK, King S, Skaf Z. A rare failure detection model for aircraft predictive maintenance modelling. Neural Comput Appl. 2023;35:2991–3009. https://doi.org/10.1007/s00521-022-07167-8
- Kostopoulos G, Davrazos G, Kotsiantis S. Explainable artificial intelligence-based decision support systems: A recent review. Electronics. 2024;13(14):2842. https://doi.org/10.3390/electronics13142842
- Pelosi D, Cacciagrano D, Piangerelli M. Explainability and interpretability in concept and data drift. Algorithms. 2025;18(7):443. https://doi.org/10.3390/a18070443
- Heredia Álvaro JA, Barreda JG. An advanced retrieval-augmented generation system for manufacturing quality control. Adv Eng Inform. 2025 Mar;64:103007. https://doi.org/10.1016/j.aei.2024.103007
- Ludwig H, Schmidt T, Kühn M. An ontology-based retrieval augmented generation procedure for a voice-controlled maintenance assistant. Computers in Industry. 2025;159:104289. https://doi.org/10.1016/j.compind.2025.104289
- Mortier T, Wydmuch M, Dembczyński K, Hüllermeier E, Waegeman W. Efficient set-valued prediction in multi-class classification. Data Mining and Knowledge Discovery. 2021;35(4):1435–1469. https://doi.org/10.1007/s10618-021-00751-x
- Johansson U, Löfström T, Sönströd C, Löfström H. Conformal prediction for accuracy guarantees in classification with reject option. In: International Conference on Modeling Decisions for Artificial Intelligence (MDAI 2023); 2023. p. 133–145. https://doi.org/10.1007/978-3-031-33498-6_9
- Artelt A, Visser R, Hammer B. “I do not know! but why?” — Local model-agnostic example-based explanations of reject. Neurocomputing. 2023;558:126722. https://doi.org/10.1016/j.neucom.2023.126722
- Campos MM, Farinhas A, Zerva C, Figueiredo MAT, Martins AFT. Conformal prediction for natural language processing: A survey. Trans Assoc Comput Linguist. 2024;12:1497–1516. https://doi.org/10.1162/tacl_a_00715
- Bhardwaj AS, Veeramani D, Zhou S. Confidently extracting hierarchical taxonomy information from unstructured maintenance records of industrial equipment. Int J Prod Res. 2023;61(23):8159–8178. https://doi.org/10.1080/00207543.2023.2167013
- Nanyonga A, Joiner K, Turhan U, Wild G. Does the choice of topic modeling technique impact the interpretation of aviation incident reports? A methodological assessment. Technologies. 2025;13(5) :209. https://doi.org/10.3390/technologies13050209
- Mohd Sharif MM, Maskat R, Baharum Z, Maskat K. A scoping review of topic modelling on online data. Indones J Electr Eng Comput Sci. 2023;31(3):1633–1641. https://doi.org/10.11591/ijeecs.v31.i3.pp1633-1641
- Blanchy G, Albrecht L, Koestel J, Garré S. Potential of natural language processing for metadata extraction from environmental scientific publications. SOIL. 2023;9:155–168. https://doi.org/10.5194/soil-9-155-2023
- Lam BD, Chrysafi P, Chiasakul T, Khosla H, Karagkouni D, McNichol M, Adamski A, Reyes N, Abe K, Mantha S, Vlachos IS, Zwicker JI, Patell R. Machine learning natural language processing for identifying venous thromboembolism: Systematic review and meta-analysis. Blood Adv. 2024;8(12):2991–3000. https://doi.org/10.1182/bloodadvances.2023012200
- Szrama S. Optimizing aircraft engine longevity: A comparative framework for dynamically adaptive predictive maintenance using autoencoders, LSTMs, and Gaussian processes. Eng Appl Artif Intell. 2025;156:111199. https://doi.org/10.1016/j.engappai.2025.111199
- Szrama S, Lodygowski T. Turbofan engine health status prediction with artificial neural network. Aviation. 2024;28(4):225–234. https://doi.org/10.3846/aviation.2024.22554
- Park S, Kwon N, Ahn Y. Forecasting repair schedule for building components based on case-based reasoning and fuzzy-AHP. Sustainability. 2019;11(24):7181. https://doi.org/10.3390/su11247181
- Wu H, Zhong B, Medjdoub B, Xing X, Jiao L. An ontological metro accident case retrieval using CBR and NLP. Appl Sci. 2020;10(15):5298. https://doi.org/10.3390/app10155298
- Bousdekis A, Lepenioti K, Apostolou D, Mentzas G. A review of data-driven decision-making methods for Industry 4.0 maintenance applications. Electronics. 2021;10(7):828. https://doi.org/10.3390/electronics10070828
- Peng B, Zhu Y, Liu Y, Bo X, Shi H, Hong C, Zhang Y, Tang S. Graph retrieval-augmented generation: A survey. ACM Trans Inf Syst. 2025;44(2):Article 35, 1–52. https://doi.org/10.1145/3777378
- Rojas L, Hernandez B, Garcia J. A systematic review of intelligent agents, language models, and recurrent neural networks in industrial maintenance: Driving value creation for the mining sector. Int J Intell Syst. 2024;2025(1):9953223. https://doi.org/10.1155/int/9953223
- Wang B, Wu J, Shi Y, et al. Structured reflective reasoning for precise medical knowledge graph retrieval augmented generation. Health Inf Sci Syst. 2025;13(1):76. https://doi.org/10.1007/s13755-025-00390-2
- Lodygowski T, Szrama S. Unsupervised classification and remaining useful life prediction for turbofan engines using autoencoders and Gaussian mixture models: A comprehensive framework for predictive maintenance. Appl Sci. 2025; 15(14): 7884. https://doi.org/10.3390/app15147884
- Szrama S, Szymański G, Mokrzan D. Aircraft propulsion health status prognostics and prediction. Adv Sci Technol Res. 2025;19(5):321–335. https://doi.org/10.12913/22998624/202232
- Kompa B, Snoek J, Beam AL. Second opinion needed: Communicating uncertainty in medical machine learning. NPJ Digit Med. 2021;4(1):4. https://doi.org/10.1038/s41746-020-00367-3
- Nemani V, Biggio L, Huan X, Hu Z, Fink O, Tran A, et al. Uncertainty quantification in machine learning for engineering design and health prognostics: A tutorial. Mech Syst Signal Process. 2023;205:110796. https://doi.org/10.1016/j.ymssp.2023.110796
- Loftus TJ, Shickel B, Ruppert MM, Balch JA, Ozrazgat-Baslanti T, Tighe PJ, Efron PA, Hogan WR, Rashidi P, Upchurch GR, Bihorac A. Uncertainty-aware deep learning in healthcare: A scoping review. PLOS Digit Health. 2022;1(8):e0000085. https://doi.org/10.1371/journal.pdig.0000085
- Singh Y, Hathaway QA, Keishing V, Salehi S, Wei Y, Horvat N, Vera-Garcia DV, Choudhary A, Mula Kh A, Quaia E, et al. Beyond post hoc explanations: A comprehensive framework for accountable AI in medical imaging through transparency, interpretability, and explainability. Bioengineering. 2025;12(8):879. https://doi.org/10.3390/bioengineering12080879
- de Gelder E, Op den Camp O. How certain are we that our automated driving system is safe? Traffic Inj Prev. 2023;24(sup1):S131–S140. https://doi.org/10.1080/15389588.2023.2186733
- Convery O, Smith L, Gal Y, Hanuka A. Uncertainty quantification for virtual diagnostic of particle accelerators. Phys Rev Accel Beams. 2021;24(7):074602. https://doi.org/10.1103/PhysRevAccelBeams.24.074602
- dos Santos Silva GF, Barcellos Filho FN, Wichmann RM, da Silva Junior FC, Porto Chiavegatto Filho AD. Strategies for detecting and mitigating dataset shift in machine learning for health predictions: A systematic review. J Biomed Inform. 2025;170:104902. https://doi.org/10.1016/j.jbi.2025.104902
- Guo LL, Pfohl SR, Fries J, Posada J, Fleming SL, Aftandilian C, Shah N, Sung L. Systematic review of approaches to preserve machine learning performance in the presence of temporal dataset shift in clinical medicine. Appl Clin Inform. 2021; 12(4):808–815. https://doi.org/10.1055/s-0041-1735184
- Cabanillas Silva P, Sun H, Rezk M, Roccaro-Waldmeyer DM, Fliegenschmidt J, Hulde N, von Dossow V, Meesseman L, Depraetere K, Stieg J, Szymanowsky R, Dahlweid FM. Longitudinal model shifts of machine learning-based clinical risk prediction models: Evaluation study of multiple use cases across different hospitals. J Med Res. 2024;26:e51409. https://doi.org/10.2196/51409
- Sáez C, Gutiérrez-Sacristán A, Kohane I, García-Gómez JM, Avillach P. EHRtemporalVariability: Delineating temporal dataset shifts in electronic health records. GigaScience. 2020;9(8):giaa079. https://doi.org/10.1093/gigascience/giaa079
- Allgaier J, Pryss R. Practical approaches in evaluating validation and biases of machine learning applied to mobile health studies. Commun Med. 2024;4(1):76. https://doi.org/10.1038/s43856-024-00468-0
- Gardner AL, Charlesworth M. How to write a retrospective observational study. Anaesthesia. 2023;78:521–525. https://doi.org/10.1111/anae.15831
- Ghaferi AA, Schwartz TA, Pawlik TM. STROBE reporting guidelines for observational studies. JAMA Surg. 2021;156(6):577–578. https://doi.org/10.1001/jamasurg.2021.0528
- Negash B, Katz A, Neilson CJ, Moni M, Nesca M, Singer A, Enns JE. Deidentification of free text data containing personal health information: A scoping review of reviews. Int J Popul Data Sci. 2023;8(1). https://doi.org/10.23889/ijpds.v8i1.2153
- Lulamba TE, Mutemaringa T, Tiffin N. Ten quick tips for protecting health data using de-identification and perturbation of structured datasets. PLOS Comput Biol. 2025;21(9):e1013507. https://doi.org/10.1371/journal.pcbi.1013507
- Usuga-Cadavid JP, Lamouri S, Grabot B, Fortin A. Using deep learning to value free form text data for predictive maintenance. Int J Prod Res. 2022;60(14): 4548–4575. https://doi.org/10.1080/00207543.2021.1951868
- Opitz J. A closer look at classification evaluation metrics and a critical reflection of common evaluation practice. Trans Assoc Comput Linguist. 2024;12:820–836. https://doi.org/10.1162/tacl_a_00675
- Valcarce D, Bellogín A, Parapar J, Castells P. Assessing ranking metrics in top-N recommendation. Inf Retr J. 2020;23:411–448. https://doi.org/10.1007/s10791-020-09377-x
- Heil BJ, Hoffman MM, Markowetz F, Lee SI, Greene CS, Hicks SC. Reproducibility standards for machine learning in the life sciences. Nat Methods. 2021;18(10):1132–1135. https://doi.org/10.1038/s41592-021-01256-7
- Pineau J, Vincent-Lamarre P, Sinha K, Larivière V, Beygelzimer A, d’Alché-Buc F, Fox E, Larochelle H. Improving reproducibility in machine learning research (a report from the NeurIPS 2019 reproducibility program). J Mach Learn Res. 2021;22(164):1–20. https://doi.org/10.48550/arXiv.2003.12206
DOI: https://doi.org/10.2478/tar-2026-0009 | Journal eISSN: 2545-2835
Language: English
Page range: 53 - 85
Submitted on: Jan 23, 2026
Accepted on: Mar 16, 2026
Published on: Jun 17, 2026
Published by: ŁUKASIEWICZ RESEARCH NETWORK – INSTITUTE OF AVIATION
In partnership with: Paradigm Publishing Services
Keywords:
Related subjects:
© 2026 Arthur Dela Peña, Jefferson Clariza, Mary Ann Aballiar-Vista, published by ŁUKASIEWICZ RESEARCH NETWORK – INSTITUTE OF AVIATION
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.