Have a personal or library account? Click to login
Assessment of Ten Insulin Resistance Surrogate Indexes Predicts New-Onset Cardiovascular Disease Incidence in Patients with Prediabetes or Diabetes: Insights from CHARLS Data with Machine Learning Analysis Cover

Assessment of Ten Insulin Resistance Surrogate Indexes Predicts New-Onset Cardiovascular Disease Incidence in Patients with Prediabetes or Diabetes: Insights from CHARLS Data with Machine Learning Analysis

By: Hang Xie,  Chaoying Yan,  Yi Zheng and  Haoyu Wu  
Open Access
|Mar 2026

Figures & Tables

Table 1

Baseline characteristics of the study population according to CVD.

VARIABLESWITHOUT CVD
(N = 2658)
WITH CVD
(N = 874)
p
HbA1c (mean, SD)5.39 (0.89)5.48 (0.95)0.017
FBG (mean, SD)120.24 (37.68)122.88 (41.46)0.08
UA (mean, SD)4.47 (1.24)4.41 (1.27)0.196
Creatinine (mean, SD)0.78 (0.19)0.77 (0.20)0.686
TG (mean, SD)136.78 (88.28)143.45 (86.23)0.051
TC (mean, SD)197.70 (38.98)199.52 (38.78)0.229
HDL (mean, SD)51.31 (15.40)49.84 (15.15)0.014
CRP (mean, SD)2.70 (7.69)2.93 (7.67)0.438
WC (mean, SD)85.02 (11.15)87.66 (11.90)<0.001
Weight (mean, SD)59.01 (10.97)60.74 (12.06)<0.001
Height (mean, SD)1.58 (0.08)1.57 (0.09)0.072
BUN (mean, SD)16.00 (4.56)15.64 (4.42)0.041
LDL (mean, SD)119.24 (34.92)120.93 (36.74)0.22
Gender (%)
    Female1407 (52.9)523 (59.8)<0.001
    Male1251 (47.1)351 (40.2)
Age (mean, SD)58.32 (8.55)60.74 (8.59)<0.001
BMI (mean, SD)23.63 (3.70)24.48 (3.99)<0.001
SBP (mean, SD)129.08 (20.01)134.15 (21.56)<0.001
DBP (mean, SD)75.21 (11.39)77.26 (12.29)<0.001
Marital status (%)
    Unmarried366 (13.8)152 (17.4)0.01
    Married2292 (86.2)722 (82.6)
Smoking status (%)
    never smoker1640 (61.7)571 (65.3)<0.001
    former smoker201 (7.6)96 (11.0)
    current smoker817 (30.7)207 (23.7)
Drinking status (%)
    never drinker1590 (59.8)544 (62.2)0.01
    former drinker208 (7.8)88 (10.1)
    current drinker860 (32.4)242 (27.7)
Education level (%)
    elementary school or below1891 (71.1)624 (71.4)0.049
    middle school708 (26.6)218 (24.9)
    college or above59 (2.2)32 (3.7)
Rural (%)
    No398 (15.0)168 (19.2)0.004
    Yes2260 (85.0)706 (80.8)
Hypertension (%)
    No1647 (62.0)394 (45.1)<0.001
    Yes1011 (38.0)480 (54.9)
Dyslipidemia (%)
    No2485 (93.5)723 (82.7)<0.001
    Yes173 (6.5)151 (17.3)
Kidney disease (%)
    No2548 (95.9)822 (94.1)0.033
    Yes110 (4.1)52 (5.9)
Liver disease (%)
    No2589 (97.4)843 (96.5)0.176
    Yes69 (2.6)31 (3.5)
Antidiabetic drugs (%)
    No2549 (95.9)804 (92.0)<0.001
    Yes109 (4.1)70 (8.0)
Lipid-lowering agents (%)
    No2570 (96.7)800 (91.5)<0.001
    Yes88 (3.3)74 (8.5)
Antihypertensive drugs (%)
    No2263 (85.1)604 (69.1)<0.001
    Yes395 (14.9)270 (30.9)
eGDR (mean, SD)9.24 (2.20)8.38 (2.34)<0.001
CVAI (mean, SD)96.63 (38.68)109.43 (40.66)<0.001
TyG (mean, SD)8.82 (0.63)8.90 (0.65)0.002
TyG-BMI (mean, SD)209.09 (39.27)218.42 (42.11)<0.001
TyG-WC (mean, SD)7.52 (1.24)7.81 (1.31)<0.001
TyG-WHtR (mean, SD)4.77 (0.80)4.98 (0.84)<0.001
METS-IR (mean, SD)36.19 (7.75)37.99 (8.36)<0.001
AIP (mean, SD)0.86 (0.76)0.95 (0.75)0.002
TyHGB (mean, SD)10.23 (3.49)10.69 (3.80)0.001
CTI (mean, SD)8.89 (0.82)9.00 (0.83)0.001
gh-21-1-1532-g1.png
Figure 1

Receiver operating characteristic (ROC) curves comparing the predictive performance of ten insulin resistance surrogate indices for cardiovascular events. (A) ROC curves for predicting total CVD events. (B) ROC curves for predicting heart disease. (C) ROC curves for predicting stroke. Among all ten insulin resistance surrogate indices evaluated, eGDR consistently showed the highest discriminative ability across all cardiovascular outcomes, while CVAI demonstrated the second-best performance for total CVD and heart disease prediction.

Table 2

Associations of ten insulin resistance surrogate indexes with new-onset cardiovascular disease incidence in patients with prediabetes or diabetes.

MODEL I
OR (95%CI)
P-VALUEMODEL II
OR (95%CI)
P-VALUEMODEL III
OR (95%CI)
P-VALUEMODEL IV
OR (95%CI)
P-VALUE
eGDR (per 1 SD)0.682(0.631–0.737)<0.0010.705(0.651–0.762)<0.0010.809(0.686–0.952)0.01120.822(0.696–0.969)0.021
eGDR quartile
Q11(Reference)1(Reference)1(Reference)1(Reference)
Q20.636(0.519–0.779)<0.0010.646(0.526–0.792)<0.0010.767(0.604–0.972)0.02890.776(0.610–0.985)0.0378
Q30.387(0.310–0.482)<0.0010.426(0.340–0.532)<0.0010.502(0.338–0.747)0.00060.519(0.349–0.776)0.0013
Q40.389(0.312–0.484)<0.0010.418(0.335–0.521)<0.0010.505(0.339–0.753)0.00070.527(0.353–0.789)0.0018
P for trend<0.001<0.0010.00310.0064
CVAI (per 1 SD)1.383(1.281–1.495)<0.0011.324(1.223–1.435)<0.0011.137(1.042–1.241)0.00381.124(1.028–1.229)0.0099
CVAI quartile
    Q11(Reference)1(Reference)1(Reference)1(Reference)
    Q21.33(1.057–1.684)0.01551.225(0.968–1.552)0.09151.157(0.910–1.472)0.2341.134(0.891–1.444)0.3062
    Q31.465(1.165–1.847)0.0021.298(1.026–1.643)0.02961.086(0.851–1.387)0.5091.042(0.815–1.335)0.7415
    Q42.343(1.881–2.927)<0.0012.047(1.635–2.569)<0.0011.383(1.083–1.769)0.00951.331(1.038–1.709)0.0243
P for trend<0.001<0.0010.01960.0483
TyHGB (per 1 SD)1.130(1.050–1.215)0.00091.153(1.070–1.240)0.00021.044(0.962–1.130)0.29811.049(0.955–1.151)0.3169
TyHGB quartile
Q11(Reference)1(Reference)1(Reference)1(Reference)
Q21.324(1.061–1.653)0.01331.330(1.064–1.666)0.01261.234(0.982–1.553)0.07211.203(0.955–1.518)0.1171
Q31.231(0.984–1.541)0.06861.261(1.005–1.583)0.04521.065(0.842–1.347)0.59931.027(0.810–1.304)0.8214
Q41.502(1.207–1.872)0.00031.556(1.247–1.945)<0.0011.144(0.903–1.449)0.26581.100(0.859–1.410)0.4484
P for trend0.00120.00040.5290.753
TyG (per 1 SD)1.125(1.043–1.213)0.00231.131(1.046–1.222)0.001891.031(0.949–1.119)0.46891.029(0.933–1.134)0.5672
TyG quartile
Q11(Reference)1(Reference)1(Reference)1(Reference)
Q21.228(0.983–1.534)0.07061.215(0.971–1.522)0.08881.130(0.898–1.422)0.29831.107(0.878–1.396)0.389
Q31.303(1.046–1.626)0.01871.269(1.014–1.588)0.03751.114(0.885–1.404)0.35761.085(0.858–1.373)0.4956
Q41.381(1.110–1.721)0.00391.397(1.119–1.746)0.00331.091(0.864–1.378)0.46271.053(0.812–1.364)0.6987
P for trend0.00380.003780.52710.7182
TyG-BMI (per 1 SD)1.254(1.163–1.352)<0.0011.305(1.208–1.411)<0.0011.125(1.033–1.225)0.00671.119(1.024–1.222)0.0127
TyG-BMI quartile
Q11(Reference)1(Reference)1(Reference)1(Reference)
Q21.021(0.812–1.283)0.8611.100(0.872–1.389)0.41951.009(0.795–1.279)0.94341.002(0.789–1.273)0.9849
Q31.337(1.072–1.668)0.011.440(1.149–1.807)0.00161.156(0.913–1.465)0.22771.133(0.892–1.441)0.3056
Q41.662(1.340–2.065)<0.0011.901(1.518–2.384)<0.0011.269(0.993–1.624)0.05661.240(0.963–1.598)0.0959
P for trend<0.001<0.0010.03010.0606
TyG-WC (per 1 SD)1.274(1.178–1.379)<0.0011.287(1.189–1.395)<0.0011.113(1.023–1.213)0.01351.108(1.014–1.212)0.0246
TyG-WC quartile
Q11(Reference)1(Reference)1(Reference)1(Reference)
Q21.113(0.887–1.398)0.35341.114(0.885–1.401)0.35861.057(0.836–1.335)0.64181.043(0.824–1.319)0.7281
Q31.233(0.985–1.544)0.06761.251(0.997–1.571)0.05341.039(0.821–1.316)0.74811.005(0.792–1.277)0.9649
Q41.829(1.475–2.273)<0.0011.851(1.488–2.306)<0.0011.279(1.009–1.623)0.0421.247(0.976–1.596)0.078
P for trend<0.001<0.0010.05570.1131
TyG-WHtR (per 1 SD)1.298(1.199–1.404)<0.0011.257(1.159–1.364)<0.0011.093(1.002–1.193)0.04431.087(0.993–1.191)0.0723
TyG-WHtR quartile
Q11(Reference)1(Reference)1(Reference)1(Reference)
Q21.246(0.990–1.570)0.06111.228(0.973–1.551)0.08461.117(0.881–1.419)0.36011.102(0.867–1.402)0.4252
Q31.404(1.119–1.763)0.00341.384(1.098–1.746)0.00591.131(0.889–1.439)0.31481.102(0.865–1.406)0.4323
Q42.049(1.647–2.555)<0.0011.904(1.515–2.396)<0.0011.313(1.026–1.682)0.03061.285(0.996–1.663)0.0544
P for trend<0.001<0.0010.03720.0696
MetS-IR (per 1 SD)1.246(1.156–1.343)<0.0011.304(1.207–1.409))<0.0011.124(1.033–1.223)0.00681.112(1.020–1.213)0.0158
MetS-IR quartile
Q11(Reference)1(Reference)1(Reference)1(Reference)
Q21.034(0.824–1.298)0.7721.089(0.865–1.371)0.46751.027(0.813–1.298)0.82111.007(0.795–1.275)0.955
Q31.250(1.001–1.561)0.04861.367(1.091–1.717)0.00671.130(0.893–1.431)0.30871.097(0.864–1.394)0.4459
Q41.665(1.343–2.068)<0.0011.879(1.505–2.349)<0.0011.278(1.005–1.628)0.04561.233(0.965–1.577)0.0936
P for trend<0.001<0.0010.03340.0728
AIP (per 1 SD)1.126(1.043–1.215)0.00231.144(1.059–1.238)0.00071.046(0.964–1.136)0.27741.038(0.949–1.136)0.4106
AIP quartile
Q11(Reference)1(Reference)1(Reference)1(Reference)
Q21.131(0.904–1.414)0.27981.139(0.909–1.427)0.25791.097(0.872–1.382)0.42961.082(0.856–1.367)0.5095
Q31.416(1.140–1.762)0.00171.433(1.150–1.788)0.00141.235(0.984–1.553)0.06841.200(0.952–1.514)0.1235
Q41.270(1.019–1.583)0.03361.327(1.062–1.661)0.0131.054(0.834–1.331)0.65951.001(0.783–1.279)0.9937
P for trend0.00810.00280.47820.7089
CTI (per 1 SD)1.144(1.060–1.234)0.00051.133(1.049–1.223)0.00151.016(0.936–1.103)0.70011.004(0.917–1.099)0.9281
CTI quartile
Q11(Reference)1(Reference)1(Reference)1(Reference)
Q21.221(0.977–1.528)0.07851.155(0.922–1.447)0.21091.079(0.857–1.359)0.51751.057(0.838–1.333)0.6409
Q31.290(1.034–1.611)0.02431.243(0.994–1.556)0.05671.031(0.817–1.300)0.79790.999(0.789–1.265)0.9978
Q41.474(1.184–1.835)0.00051.416(1.135–1.767)0.0021.079(0.855–1.361)0.52311.037(0.808–1.332)0.7741
P for trend0.00060.00170.63520.9036

[i] Model I was unadjusted; Model II included adjustments for gender and age; and Model III contained additional adjustments for marital status, smoking status, drinking status, education level, residence, hypertension, dyslipidemia, kidney disease, liver disease, antidiabetic drugs, lipid-lowering agents, antihypertensive drugs. Model IV further adjusted for TC, LDL, BUN, Creatinine, SBP, and DBP.

gh-21-1-1532-g2.png
Figure 2

Subgroup analyses of the associations between insulin resistance indices and incident cardiovascular disease. (A) Forest plot showing the association between Chinese Visceral Adiposity Index (CVAI) and CVD risk across different demographic and clinical subgroups. (B) Forest plot illustrating the association between estimated glucose disposal rate (eGDR) and CVD risk across subgroups. No significant interactions were observed between either index and demographic or clinical characteristics (all P for interaction > 0.05), suggesting consistent associations across different subgroups.

gh-21-1-1532-g3.png
Figure 3

Dose-response relationships between insulin resistance indices and risk of incident cardiovascular disease. (A) Restricted cubic spline analysis demonstrating the relationship between Chinese Visceral Adiposity Index (CVAI) and the odds ratio of new-onset cardiovascular disease. (B) Restricted cubic spline analysis illustrating the relationship between estimated glucose disposal rate (eGDR) and the odds ratio of new-onset cardiovascular disease.

gh-21-1-1532-g4.png
Figure 4

Feature selection for cardiovascular disease risk prediction using machine learning approaches. (A) The Boruta algorithm identified 13 features as important predictors of CVD risk. (B) Feature selection using LASSO (Least Absolute Shrinkage and Selection Operator) regression with ten-fold cross-validation. The plot shows the binomial deviance against log(λ), where λ is the regularization parameter. The numbers above the plot indicate the number of non-zero coefficients retained in the model at each value of λ. The optimal model using the minimum lambda criterion identified 15 key variables. (C) Venn diagram showing the intersection of features selected by both Boruta and LASSO methods, resulting in 10 final variables (gender, education, dyslipidemia, hypertension, UA, WC, BUN, DBP, age, and BMI) that were subsequently used in the machine learning models for predicting CVD incidence.

gh-21-1-1532-g5.png
Figure 5

Performance comparison of nine machine learning algorithms for predicting cardiovascular disease incidence. The receiver operating characteristic (ROC) curves illustrate the discriminative ability of each model in identifying individuals at risk of developing cardiovascular disease. The K-Nearest Neighbors (KNN) algorithm demonstrated markedly superior performance with the highest area under the curve (AUC) of 0.9315 (95% CI: 0.924–0.939), indicating excellent discrimination. All models were trained using the final set of 10 selected features identified through the combined Boruta and LASSO feature selection process.

gh-21-1-1532-g6.png
Figure 6

Incremental predictive value of insulin resistance indices when added to the baseline model for cardiovascular disease prediction. This forest plot demonstrates the improvement in discriminative ability when eGDR and CVAI were incorporated into the K-Nearest Neighbors (KNN) algorithm, which showed superior performance among all machine learning approaches.

gh-21-1-1532-g7.png
Figure 7

Comprehensive evaluation of performance metrics across nine machine learning models incorporating eGDR and CVAI for cardiovascular disease prediction. This multi-panel figure displays six key performance metrics for each model: (A) Accuracy; (B) Precision; (C) Recall; (D) F1 score; (E) Specificity; and (F) Sensitivity.

DOI: https://doi.org/10.5334/gh.1532 | Journal eISSN: 2211-8179
Language: English
Submitted on: May 29, 2025
|
Accepted on: Feb 20, 2026
|
Published on: Mar 12, 2026
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2026 Hang Xie, Chaoying Yan, Yi Zheng, Haoyu Wu, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.