Fig. 1.

Fig. 2.

Fig. 3.

Fig. 4.

Fig. 5.

Fig. 6.

Comparison of our method with recent studies on pathological voice detection_
| Study | Dataset | Features and model | Accuracy [%] |
|---|---|---|---|
| [29] | SVD | Multipeak, Gaussian mixture model (GMM) | 91.83 |
| [30] | SVD + HUPA | MFCCs, SVM | 71.45–76.19 |
| [31] | MEEI voice disorders | MFCC (500 ms frames, 5 ms shift), SVM | 66.4–75.1 |
| [32] | SVD + HUPA | wav2vec, SVM | 68.55–83.11 |
| [33] | SVD + HUPA | Mel-spectrogram, SVM | 69.45–75 |
| [34] | VOICED | wav2vec 2.0, SVM / KNN | 98 |
| [35] | UA-speech + TORGO | MFCCs, SVM | 63.13–89.22 |
| This work | SVD | EMD-IMF, Mel-spectrogram + scalogram, AlexNet-CNN | 85.66 / 86.4 |