Figure 1:

Figure 2:

Figure 3:

Figure 4:

Figure 5:

Figure 6:

Figure 7:

Figure 8:

Figure 9:

Figure 10:

Figure 11:

Figure 12:

Figure 13:

Figure 14:

Figure 15:

Figure 16:

Figure 17:

Figure 18:

Figure 19:

Figure 20:

Figure 21:

Research questions_
| Research question No. | Research question | Objective |
|---|---|---|
| 1. | How can the integration of AI-driven methodologies in breast cancer research improve early detection and personalized treatment plans? |
|
| 2. | How does the volume of research publications in ECG, CKD, diabetes, and breast cancer correlate with healthcare challenges and research priorities in different countries? | Interpret how these research patterns might indicate the healthcare challenges faced by these countries. |
| 3. | How does author productivity vary among ECG, diabetes, CKD, and breast cancer research fields according to Lotka’s law? | Determine which field has the highest and lowest percentage of highly productive authors. |
| 4. | What insights can Biblioshiny keyword co-occurrence analysis provide for a specific field of research? | Keyword co-occurrence analysis identifies the most frequent terms in a field, revealing key research themes, emerging trends, and areas of academic focus or future opportunities. |
Inclusion and exclusion criteria_
| Inclusion criteria | Exclusion criteria |
|---|---|
|
|
Literature review from 2020 to 2024_
| Reference | Year | Algorithm implemented | Contribution |
|---|---|---|---|
| [42] | 2024 | 12 different classifiers that belong to six learning strategies were evaluated using two datasets. | Diagnosing diabetes |
| [43] | 2024 | Ten ML classifiers were used | Prediction of diabetes disease through a mobile app |
| [35] | 2024 | Used sensor technology | Work on gestational diabetes |
| [7] | 2023 | Gradient boosting tree, RF, KNN, and SVM | The authors study an automatic arrhythmia classification method for the healthcare system, the MHO algorithm, and the ML classifier. |
| [6] | 2023 | Convolutional neural network | The CNN algorithm is used for feature extraction on the ECG image dataset |
| [44] | 2023 | NB, LR, SVM, KNN, DT, AdaBoost, XGboost | Compared the performance level by using AUC |
| [40] | 2022 | SVM, LR, DT, XGBOOST, RF, AdaBoost | ADAboost provides 100% sensitivity for the detection of CKD |
| [45] | 2022 | SVM, RF, Gradient boosting, Ada boost | Predicting breast cancer based on different medical symptoms |
| [41] | 2021 | RF, XGBOOST, neural network | Evaluated the risk of CKD |
| [10] | 2021 | 8 ML classifiers were used | Performance analysis of CKD |
| [46] | 2021 | SVM, RF, LR, DT, KNN | Breast cancer prediction and diagnosis |
| [47] | 2020 | Six supervised ML algorithms | Prediction of breast cancer using various ML algorithms |
| [9] | 2020 | K-nearest neighbor, LR, RF, SVM, and decision tree | Diabetes prediction |
| [38] | 2020 | Eleven ML classifiers were used | Prediction of CKD |
| [39] | 2020 | Seven ML techniques are utilized | Classifying the kidney patient dataset as CKD or NOTCKD |
| [30] | 2020 | SVM | An SVM classifier is proposed to classify the heartbeat. The result of SVM was compared with other classifiers |
| [29] | 2020 | SVM, KNN, ANN | Construct a biometric recognition system |
Summary of the ML classification and clustering algorithms in terms of accuracy_
| Supervised ML | ||||
|---|---|---|---|---|
| Classification algorithms | References | Year | Task | Accuracy (%) |
| Support vector regression | [18] | 2022 | Predicting and illustrating the COVID-19 pandemic | 94 |
| Decision trees | [19] | 2022 | Prediction of diabetes | 96 |
| Naïve Bayes | [20] | 2020 | Skin disease detection | 94.3 |
| Naïve Bayes | [21] | 2020 | Heart disease detection | 88.16 |
| Ensemble techniques | [22] | 2020 | Predict the normal weekly cost that patients will spend on specific medicines | 98 |
| Decision trees | [23] | 2020 | Heart disease prediction | 88 |
| SVM | [24] | 2019 | Speech recognition, facial recognition | 91.3 |