Have a personal or library account? Click to login
Identification of women with high grade histopathology results after conisation by artificial neural networks Cover

Identification of women with high grade histopathology results after conisation by artificial neural networks

Open Access
|Aug 2022

Figures & Tables

Figure 1

Schematic of simple neural network with input, output and three hidden layers.
Schematic of simple neural network with input, output and three hidden layers.

Figure 2

Matthews correlation coefficient (MCC) for categorisation squamous intraepithelial lesion (HSIL)-combined for YES and NO prediction for different equalisation methods (no correction of minority class, under-sampling, oversampling and synthetic minority over-sampling technique [SMOTE]) for both RAW and Class settings. Best performance of multi-layer perceptron (MLP) is on dataset with data organised in classes and over-sampling method for minority class – MCC = 0.64. Lowest performance is with original dataset without correction for minority class – MCC = 0.086.
Matthews correlation coefficient (MCC) for categorisation squamous intraepithelial lesion (HSIL)-combined for YES and NO prediction for different equalisation methods (no correction of minority class, under-sampling, oversampling and synthetic minority over-sampling technique [SMOTE]) for both RAW and Class settings. Best performance of multi-layer perceptron (MLP) is on dataset with data organised in classes and over-sampling method for minority class – MCC = 0.64. Lowest performance is with original dataset without correction for minority class – MCC = 0.086.

Figure 3

True positive and False positive rate for different settings for prediction Yes and No combined and for different equalisation methods (no correction of minority class, under-sampling, over-sampling and synthetic minority over-sampling technique [SMOTE]) for both RAW and Class settings. Best performance model from Figure 2 has 0.842 true positive rate and 0.182 false positive rate. Lowest performance model from Figure 2 has high 0.814 true positive rate which is almost as high as best performance model but also high false positive rate 0.735.
Raw = original settings; Class = class setting; FPR = false positive rate; HSIL = high grade squamous intraepithelial lesion; overs = oversampling; TPR = true positive rate; unders = undersampling; SMOTE = synthetic minority over-sampling technique
True positive and False positive rate for different settings for prediction Yes and No combined and for different equalisation methods (no correction of minority class, under-sampling, over-sampling and synthetic minority over-sampling technique [SMOTE]) for both RAW and Class settings. Best performance model from Figure 2 has 0.842 true positive rate and 0.182 false positive rate. Lowest performance model from Figure 2 has high 0.814 true positive rate which is almost as high as best performance model but also high false positive rate 0.735. Raw = original settings; Class = class setting; FPR = false positive rate; HSIL = high grade squamous intraepithelial lesion; overs = oversampling; TPR = true positive rate; unders = undersampling; SMOTE = synthetic minority over-sampling technique

Figure 4

Receiver operator characteristic (ROC) curve for multi-layer perceptron (MLP) performance on dataset without grouping in classes and no correction for minority class where X axis represent 1- specificity (false positive rate) and Y axis represents sensitivity (true positive rate). Area under the ROC curve (AUC) = 0.594. AUC for categorisation with random guessing is 0.5. This Figure represents model with lowest performance of MLP from our study.
Receiver operator characteristic (ROC) curve for multi-layer perceptron (MLP) performance on dataset without grouping in classes and no correction for minority class where X axis represent 1- specificity (false positive rate) and Y axis represents sensitivity (true positive rate). Area under the ROC curve (AUC) = 0.594. AUC for categorisation with random guessing is 0.5. This Figure represents model with lowest performance of MLP from our study.

Figure 5

Receiver operator characteristic (ROC) curve for multi-layer perceptron (MLP) performance on dataset with patients grouping in classes and synthetic minority over-sampling technique (SMOTE) correction for minority class where X axis represent 1- specificity (false positive rate) and Y axis represents sensitivity (true positive rate). Area under the ROC curve (AUC) = 0.802 which is well above classification with random guessing where AUC is 0.5. This Figure represents best performance model of MLP from our study.
Receiver operator characteristic (ROC) curve for multi-layer perceptron (MLP) performance on dataset with patients grouping in classes and synthetic minority over-sampling technique (SMOTE) correction for minority class where X axis represent 1- specificity (false positive rate) and Y axis represents sensitivity (true positive rate). Area under the ROC curve (AUC) = 0.802 which is well above classification with random guessing where AUC is 0.5. This Figure represents best performance model of MLP from our study.

Number and percentage of patients according to human papilloma virus (HPV) 16 and 18 statuses in high grade squamous intraepithelial lesion (HSIL) and NO-HSIL group

HPV 16HPV 18

HSIL groupNO-HSIL groupHSIL groupNO-HSIL group

Frequency%Frequency%Frequency%Frequency%
not performed177142916172132715
negative69354106577756012065
positive419325127342273920
Total12891001861001289100186100

Results of multi-layer perceptron (MLP) classifications for different settings with baseline prediction – ZeroR, percentage of correct classification and Kappa statistic for all analysis_ Results are for prediction high grade squamous intraepithelial lesion (HSIL)-Yes (Y), prediction NO-HSIL (N) and weighted average for whole model (YES and NO combined) – Weighted average (AVG)_ In bold-type letters are results, where prediction by MLP is better than baseline prediction ZeroR

TP RateFP RatePrecisionRecallF-MeasureMCCROC AreaPRC AreaClass% CorrectKappaZeroR %
Class_orig–Y0.7510.6340.7390.7510.7450.1180.5670.735Yes82.100.096587.39
Class_orig–N0.3660.2490.3080.3660.3730.1180.5670.377No
Class_orig–AVG0.6370.5210.6330.6370.6350.1180.5670.629Weighted Avg
Class_overs–Y0.8600.2010.9080.8600.8840.6400.8700.920Yes84.190.637669.79
Class_overs–N0.7990.1400.7120.7990.7530.6400.8700.703No
Class_overs–AVG0.8420.1820.8490.8420.8440.6400.8700.855Weighted Avg
Class_SMOTE–Y0.7970.2740.8340.7970.8150.5150.8020.850Yes77.080.514163.40
Class_SMOTE–N0.7260.2030.6730.7260.6990.5150.8020.669No
Class_SMOTE–AVG0.7710.2480.7750.7710.7720.5150.8020.784Weighted Avg
Class_unders–Y0.6690.5590.6360.6690.6520.1120.5420.608Yes57.640.111359.39
Class_unders–N0.4410.3310.4770.4410.4580.1120.5420.448No
Class_unders–AVG0.5760.4660.5720.5760.5730.1120.5420.543Weighted Avg
RAW_orig–Y0.9070.8280.8840.9070.8950.0860.5940.905Yes81.420.085687.39
RAW_orig–N0.1720.0930.2110.1720.1890.0860.5940.174No
RAW_orig–AVG0.8140.7350.7990.8140.8060.0860.5940.813Weighted Avg
RAW_overs–Y0.8250.2850.8700.8250.8470.5250.8370.905Yes79.210.52369.79
RAW_overs–N0.7150.1750.6390.7150.6750.5250.8370.661No
RAW_overs–AVG0.7920.2520.8000.7920.7950.5250.8370.831Weighted Avg
RAW_SMOTE–Y0.8000.2580.8430.8000.8210.5330.8140.867Yes77.870.531863.4
RAW_SMOTE–N0.7420.2000.6810.7420.7100.5330.8140.691No
RAW_SMOTE–AVG0.7790.2370.7840.7790.7800.5330.8140.802Weighted Avg
RAW_unders–Y0.6880.5750.6360.6880.6610.1150.5510.614Yes58.080.114459.39
RAW_unders–N0.4250.3130.4820.4250.4510.1150.5510.466No
RAW_unders–AVG0.5810.4690.5730.5810.5760.1150.5510.554Weighted Avg

Confusion matrix for classification with all possible outcomes

Predicted pos (PP)Predicted neg (PN)
Actual pos (P)True positives (TP)False negatives (FN)
Actual neg (N)False positives (FP)True negatives (TN)

Final histology of the cone in patients without human papilloma virus (HPV) testing

FrequencyPercent
NO dysplasia91.8
CIN 1265.3
CIN 1–2275.4
CIN 29018.1
CIN 2–35511.1
CIN 322345.0
CIS5511.1
invasive ca112.2
Total496100.0
DOI: https://doi.org/10.2478/raon-2022-0023 | Journal eISSN: 1581-3207 | Journal ISSN: 1318-2099
Language: English
Page range: 355 - 364
Submitted on: Jan 16, 2022
|
Accepted on: Apr 25, 2022
|
Published on: Aug 14, 2022
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2022 Marko Mlinaric, Miljenko Krizmaric, Iztok Takac, Alenka Repse Fokter, published by Association of Radiology and Oncology
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.