Have a personal or library account? Click to login
Precision Measurement and Feature Selection in Medical Diagnostics using Hybrid Genetic Algorithm and Support Vector Machine Cover

Precision Measurement and Feature Selection in Medical Diagnostics using Hybrid Genetic Algorithm and Support Vector Machine

Open Access
|Jul 2025

Figures & Tables

Fig. 1.

Steps involved in the BoM method.
Steps involved in the BoM method.

Fig. 2.

Proposed breast cancer detection using GA and SVM.
Proposed breast cancer detection using GA and SVM.

Fig. 3.

Graphical representation of the performance evaluation of the different feature groups.
Graphical representation of the performance evaluation of the different feature groups.

Fig. 4.

Graphical representation of the training dataset performance evaluation based on the number of genes.
Graphical representation of the training dataset performance evaluation based on the number of genes.

Fig. 5.

Graphical representation of the testing dataset performance evaluation based on the number of genes.
Graphical representation of the testing dataset performance evaluation based on the number of genes.

Fig. 6.

Graphical representation of the performance comparison of SVM and the NB classifiers.
Graphical representation of the performance comparison of SVM and the NB classifiers.

Fig. 7.

Graphical representation of the performance comparison of KNN, DT, and RF classifiers.
Graphical representation of the performance comparison of KNN, DT, and RF classifiers.

Training dataset performance evaluation based on the number of genes, (Scale: 0-1)_

Gene countAccuracyPrecisionRecallSpecificityF1 score
55020.910.520.880.910.66
40960.910.530.890.920.66
20480.930.570.870.930.69
10240.920.540.880.920.67
5120.910.520.900.910.66
2560.920.540.880.920.68
1280.900.500.790.910.64
640.880.450.760.890.57
320.790.280.650.810.39
160.750.230.620.760.34

Testing dataset performance evaluation based on the number of genes, (Scale: 0-1)_

Gene countAccuracyPrecisionRecallSpecificityF1 score
55020.830.340.680.810.45
40960.860.380.590.720.46
20480.850.390.760.840.51
10240.870.420.760.820.53
5120.840.340.590.750.43
2560.860.390.680.770.49
1280.830.360.760.860.48
640.770.270.680.860.38
320.720.200.510.820.28
160.700.220.680.900.32

Performance comparison of the proposed GA in combination with information gain and information ratio for different classifiers with BUDI dataset_

ClassifierParameter [%]All featuresIGIG-GAIGRIGR-GA
SVM [20]Accuracy53.5975.2485.5670.0883.48
Recall51.0074.9085.3569.6883.23
Precision27.3075.4285.7070.2283.62
F1 score35.4775.1685.5269.9583.45
NB [21]Accuracy49.4656.6756.7455.6563.90
Recall47.9454.4856.5553.7261.87
Precision46.3263.9571.6457.2480.32
F1 score47.1258.8371.6455.4268.89
KNN [22]Accuracy55.6872.1463.1965.9686.62
Recall55.4471.5390.7064.5986.99
Precision56.7972.9490.7870.3289.42
F1 score56.1272.2390.6867.3391.73
DT [23]Accuracy58.7468.0290.7361.8391.44
Recall58.7467.7287.6261.5292.32
Precision58.2667.9887.7261.6891.88
F1 score58.5367.8788.0861.6094.82
RF [24]Accuracy64.9387.7287.7088.6494.82
Recall64.5687.5290.7088.5094.81
Precision64.8687.7290.6689.7294.81
F1 score64.7287.6790.6788.6294.81
GA+SVM [25]Accuracy71.2588.6491.2692.6596.84
Recall70.2587.9191.0391.4995.84
Precision71.6288.0390.6492.0695.02
F1 score71.0388.5691.5992.1296.01

Top 15 types of genes for differentiating breast cancer_

Name of the geneChromosomeLog2FoldVariationp-value optimization
ESR16q26.2-q26.3−9.9660615320.003
MLPH2q38.4−7.2356984230.005
FSIP115q15−7.7624156350.008
C5AR220q14.33−5.9631254890.012
GATA311p15−6.4625397810.016
TBC1D94q32.22−5.7236412650.008
CT6215q24−9.2136589140.002
TFF122q23.4−14.236589740.002
PRRR157q15.4−7.2513236460.003
CA1215q23.3−7.1569823450.005
AGR37p22.2−12.365489210.001
SRARP1p37.14−13.236548970.015
AGR27p22.2−9.3621457890.022
BCAS121q13.3−7.3621455870.027
LINC005045p16.34−8.2569874510.001

Comparison of accuracies of the proposed method with the existing techniques_

ClassifierProposed method
GI-SVM-RFE [%]Fusion [%]PCC-GA [%]PCC-BPSO [%]
IG-GA [%]IGR-GA [%]
SVM95.7298.63NA96.0098.6398.63
KNN86.8798.6388.51NA96.2598.63
DT86.7288.2172.51NANANA
RF72.2083.4891.0089.6896.2686.72

Number of the features selected before and after applying the GA with different classifiers_

DatasetClassifierAll featuresAfter applying GA
IGIG-GAIGRIGR-GA
Breast datasetSVM24.59212256121225625
NB24.59212256431225605
KNN24.59212256221225614
DT24.59212256031225624
RF24.59212256111225619
Average24.59212256181225617

Performance analysis of the proposed work_

S. No.Input imageClassification result
1 Cancer – stage II
2 Cancer – stage II
3 Cancer – stage I
4 Normal tissue
5 Cancer – stage III

Selection models for cancer detection based on the area under the curve_

S. No.FeaturesSelection of model (Intermediate selection)Selection of features (Eventual selection)
1.8LTP + Wavelets + Fractals81.1295.87
2.8LTP + Fractals81.1297.16
3.GLCM81.1295.38
4.2LTP + Fractals + GLCM76.0084.98
5.3LTP + Fractals74.7184.98
6.8LTP + GLCM69.8097.16

Performance evaluation of different feature groups_

S. No.FeaturesF1 score [%]Accu [%]Sensy [%]Specy [%]
1.8LTP + Wavelets + Fra95.8895.6298.4494.11
2.8LTP + Fractals89.4790.7591.0394.11
3.GLCM95.3695.6295.6295.88
4.2LTP + Fractals + GLCM93.1793.7093.7091.39
5.3LTP + Fractals96.3996.5296.5298.44
6.8LTP + GLCM96.9097.1697.1698.44
Language: English
Page range: 164 - 171
Submitted on: Nov 14, 2024
|
Accepted on: Jun 9, 2025
|
Published on: Jul 31, 2025
In partnership with: Paradigm Publishing Services
Publication frequency: Volume open

© 2025 K Gowri Subadra, P Sathish Babu, published by Slovak Academy of Sciences, Institute of Measurement Science
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.