Have a personal or library account? Click to login
Disparity of Imputed Data from Small Area Estimate Approaches – A Case Study on Diabetes Prevalence at the County Level in the U.S. Cover

Disparity of Imputed Data from Small Area Estimate Approaches – A Case Study on Diabetes Prevalence at the County Level in the U.S.

Open Access
|Apr 2018

Figures & Tables

dsj-17-763-g1.jpg
Figure 1

Geographic distribution of the number of diagnosed diabetes cases in the Behavioral Risk Factor Surveillance Survey among 3,109 U.S. counties in 2012. The color pattern was categorized by the quartiles of the number of diagnosed diabetes.

Table 1

Summary table of diabetes in the U.S.

Diagnosed diabetesP-value*
No
N (%)
Yes
N (%)
Age<.0001
   18–45105446 (96.65%)3650 (3.35%)
   45–64140475 (86.91%)21159 (13.09%)
   65+105983 (80.03%)26440 (19.97%)
Sex<.0001
   Male140905 (86.67%)21673 (13.33%)
   Female214109 (87.72%)29962 (12.28%)
Race<.0001
   Non-Hispanic White282189 (88.27%)37485 (11.73%)
   Non-Hispanic Black29536 (80.34%)7227 (19.66%)
   Hispanic22269 (86.33%)3525 (13.67%)
   Others16710 (86.55%)2596 (13.45%)

[i] * Chi-square test, where 7519 missing values are excluded.

dsj-17-763-g2.jpg
Figure 2

Comparing three SAEs of diabetes prevalence at the county level using quintiles.

dsj-17-763-g3.jpg
Figure 3

Scatter plots of small area estimates among Models 1, 2 and 3.

Table 2

Descriptive statistics of diabetes prevalence estimates from the three SAE models.

ModelMeanSDMinimumQ1MedianQ3MaximumFP-value
All counties (N = 3,109)
10.11520.02370.05080.09860.11210.12820.2402910.22<.0001
20.10420.02380.03350.08700.10240.11930.2171
30.09020.02200.03420.03730.08550.10230.1789
Counties with samples in the BRFSS (N = 2,225)
10.11460.02220.05360.09940.11240.12720.2210377.93<.0001
20.10340.02330.04050.08680.10240.11870.1940
30.09600.02260.03510.08000.09310.10970.1789
Counties without samples in the BRFSS (N = 884)
10.11670.02710.05100.09690.11050.13110.2402831.05<.0001
20.10630.02500.03350.08750.10230.12080.2171
30.07550.01050.03420.06870.07360.08060.1337

[i] Abbreviation: SD = Standard deviation; Q1 = The first quartile; Q3 = The third quartile

† The p-values were calculated from the analysis of variation.

Table 3

Mean difference comparison in the SAE of diabetes prevalence among Models 1, 2 and 3.

ComparisonDifference95% CI
All counties (N = 3,109)
Model 1 vs. Model 20.0110(0.0096, 0.0124)
Model 1 vs. Model 30.0250(0.0237, 0.0264)
Model 2 vs. Model 30.0140(0.0127, 0.0154)
Counties with samples in the BRFSS (N = 2,225)
Model 1 vs. Model 20.0112(0.0096, 0.0128)
Model 1 vs. Model 30.0186(0.0170, 0.0202)
Model 2 vs. Model 30.0073(0.0057, 0.0089)
Counties without samples in the BRFSS (N = 884)
Model 1 vs. Model 20.0104(0.0079, 0.0129)
Model 1 vs. Model 30.0413(0.0388, 0.0438)
Model 2 vs. Model 30.0309(0.0284, 0.0333)
dsj-17-763-g4.jpg
Figure 4

The observer agreement charts of categorized small area estimates among Models 1, 2 and 3.

Language: English
Submitted on: Nov 27, 2017
Accepted on: Mar 9, 2018
Published on: Apr 6, 2018
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2018 Lung-Chang Chien, Ge Lin, Xiao Li, Xingyou Zhang, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.