Adjusting for Selection Bias in Nonprobability Samples by Empirical Likelihood Approach

Daniela Marella

doi:10.2478/jos-2023-0008

References

Agresti, A. 2007. An Introduction to Categorical Data Analysis (second edition). John Wiley & Sons, Inc., Hoboken: New Jersey.
Search in Google Scholar Back to article
Babu, G.J., and C.R. Rao. 2004. “Goodness-of-Fit Tests when Parameters are Estimated.” Sankhyā. Series A 66(1): 63–74. DOI: https://doi.org/10.2307/25053332.
Search in Google Scholar Back to article
Banca d’Italia. 2012. Supplement to the Statistical Bulletin, Sample Surveys, Household income and wealth in 2010: 12(6). Available at: https://www.bancaditalia.it/pubblicazioni/indagine-famiglie/bil-fam2010.
Search in Google Scholar Back to article
Beaumont, J.F. 2000. “An Estimation Method for Nonignorable Nonresponse.” Survey Methodology 26(2): 131–136. Available at: https://www150.statcan.gc.ca/n1/en/pub/12-001-x/2000002/article/5532-eng.pdf?st=WJWdN_3l.
Search in Google Scholar Back to article
Belzile, L., J.L. Wadsworth, P.J. Northrop, S.D. Grimshaw, J. Zhang, M.A. Stephens, A.B. Owen, and R. Huser. 2022. mev: Modelling Extreme Values. R package version 1.14 Available at: https://CRAN.R-project.org/package=mev (accessed June 2022).
Search in Google Scholar Back to article
Beresewicz, M., R. Lehtonen, F. Reis, L. Di Consiglio and M. Karlberg. 2018. An overview of methods for treating selectivity in big data sources. Statistical Working Papers, Eurostat. Available at: https://ec.europa.eu/eurostat/web/products-statistical-working-papers/-/ks-tc-18-004 (accessed July 2022).
Search in Google Scholar Back to article
Chang, T., and P.S. Kott. 2008. “Using Calibration Weighting to Adjust for Nonresponse under a Plausible Model.” Biometrika 95(3): 555–571. DOI: https://doi.org/10.1093/-biomet/asn022.
Search in Google Scholar Back to article
Chaudhuri S., M.S. Handcock, and M.S. Rendall. 2010. A conditional empirical likelihood approach to combine sampling design and population level information. Technical report No. 3/2010, National University of Singapore, Singapore. Available at: https://cpb-us-w2.wpmucdn.com/blog.nus.edu.sg/dist/0/14452/files/2020/10/tr032010.pdf (accessed July 2022).
Search in Google Scholar Back to article
Chen, C., M. Wang, R. Wu, and R. Li. 2022. “A Robust Consistent Information Criterion for Model Selection Based on Empirical Likelihood.” Statistica Sinica 32: 1205–1223. DOI: https://doi.org/10.5705/ss.202020.0254.
Search in Google Scholar Back to article
Conti P.L., D. Marella, and M. Scanu. 2008. “Evaluation of matching noise for imputation techniques based on nonparametric local linear regression estimators.” Computational Statistics & Data Analysis 53(2): 354–365. DOI: https://doi.org/10.1016/j.csda.2008.07.041.
Search in Google Scholar Back to article
DiSogra, C., C. Cobb, E. Chan, and J. M. Dennis. 2011. “Calibrating non-probability internet samples with probability samples using early adopter characteristics.” In Proceedings of the Section on Survey Research Methods, Joint Statistical Meetings. Miami Beach, Florida, July 30-August 4, 2011: 4501–4515. Alexandria, VA: American Statistical Association. Available at: http://www.asasrms.org/Proceedings/y2011/Files/30270468925.pdf (accessed June 2022).
Search in Google Scholar Back to article
Elliott, M., and R. Valliant. 2017. “Inference for non-probability samples.” Statistical Science 32(2): 249–264. DOI: https://doi.org/10.1214/16-STS598.
Search in Google Scholar Back to article
Feder, M., and D. Pfeffermann. 2019. Statistical Inference Under Non-ignorable Sampling and Non-response. An Empirical Likelihood Approach. Working paper. University of Southampton. Available at: https://eprints.soton.ac.uk/378245/ (accessed July 2022).
Search in Google Scholar Back to article
Galimard J.E., S. Chevret, E. Curis, and M. Resche-Rigon. 2018. “Heckman imputation models for binary or continuous MNAR outcomes and MAR predictors.” BMC Medical Research Methodology 18(90). DOI: https://doi.org/10.1186/s12874-018-0547-1.
Search in Google Scholar Back to article
Hájek, J. 1964. “Asymptotic theory of rejective sampling with varying probabilities from a finite population.” The Annals of Mathematical Statistics 35(4): 1491–1523. DOI: 10.1214/aoms/1177700375.
Open DOI Search in Google Scholar Back to article
Heckman, J.J. 1979. “Sample Selection Bias as a Specification Error.” Econometrica 47(1): 153–161. DOI: http://dx.doi.org/10.2307/1912352.
Search in Google Scholar Back to article
Kim, J.K., and Z. Wang. 2019. “Sampling techniques for big data analysis in finite population inference.” International Statistical Review 87(S1): S177–S191. DOI: https://doi.org/10.1111/insr.12290.
Search in Google Scholar Back to article
Kott, P.S. 2006. “Using calibration weighting to adjust for nonresponse and coverage errors.” Survey Methodology 32(2): 133–142. Available at: https://www150.statcan.gc.ca/n1/en/pub/12-001-x/2006002/article/9547-eng.pdf?st=B2aZNvo0.
Search in Google Scholar Back to article
Kott, P.S., and T. Chang. 2010. “Using Calibration Weighting to Adjust for Nonignorable Unit Nonresponse.” Journal of the American Statistical Association 105(491): 1265–1275. DOI: https://doi.org/10.1198/jasa.2010.tm09016.
Search in Google Scholar Back to article
Lee, J., and J.O. Berger. 2001. “Semiparametric Bayesian analysis of selection models.” Journal of the American Statistical Association 96(456): 1397–1409. DOI: https://doi.org/10.1198/016214501753382318.
Search in Google Scholar Back to article
Marella D., M. Scanu, and P.L. Conti. 2008. “On the matching noise of some nonparametric imputation procedures.” Statistics & Probability Letters 78(12): 1593–1600. DOI: https://doi.org/10.1016/j.spl.2008.01.020.
Search in Google Scholar Back to article
Marella, D., and D. Pfeffermann. 2019 “Matching Information from two independent informative samples.” Journal of Statistical Planning and Inference 203: 70–81. https://doi.org/10.1016/j.jspi.2019.03.001.
Search in Google Scholar Back to article
Marella, D., and D. Pfeffermann. 2021 “Accounting for nonignorable sampling and nonresponse in statistical matching.” International Statistical Review. Accepted for pubblication. DOI: https://doi.org/10.1111/insr.12524.
Search in Google Scholar Back to article
Meng, X-L., 2018. “Statistical paradises and paradoxes in big data (I): law of large populations, big data paradox and the 2016 US presidential election.” The Annals of Applied Statistics 12(2): 685–726. DOI: https://doi.org/10.1214/18-AOAS1161SF.
Search in Google Scholar Back to article
Owen, A.B. 2001. Empirical Likelihood. Chapman & Hall/CRC: New York.
Search in Google Scholar Back to article
Owen, A.B. 2013. “Self-concordance for empirical likelihood.” Canadian Journal ofStatistics 41(3): 387–397. DOI: https://doi.org/10.1002/cjs.11183.
Search in Google Scholar Back to article
Pfeffermann, D., A.M. Krieger, and Y. Rinott. 1998. “Parametric distribution of complex survey data under informative probability sampling.” Statistica Sinica 8(4): 1087–1114.
Search in Google Scholar Back to article
Pfeffermann, D., and M. Sverchkov. 2009. “Inference under Informative Sampling.” In Handbook of Statistics 29B: Sample Surveys: Inference and Analysis, edited by D. Pfeffermann and C.R. Rao.: 455–487. North Holland.
Search in Google Scholar Back to article
Pfeffermann, D. 2011. “Modelling of complex survey data: Why model? Why is it a problem? How can we approach it?” Survey Methodology 37(2): 115–136. Available at: https://www150.statcan.gc.ca/n1/en/pub/12-001-x/2011002/article/11602-eng.pdf?st=_vXrCcPb.
Search in Google Scholar Back to article
Pfeffermann, D., and V. Landsman. 2011. “Are private schools really better than public schools? Assessment by methods for observational studies.” Annals of Applied Statistics 5(3): 1726–1751. DOI: https://doi.org/10.1214/11-AOAS456.
Search in Google Scholar Back to article
Pfeffermann, D., and A. Sikov. 2011. “Imputation and Estimation under Nonignorable Non-response in Household Surveys with Missing Covariate Information.” Journal of Official Statistics 27(2): 181–209. Available at: https://www.scb.se/contentassets/ff271eeeca694f47ae99b942de61df83/imputation-and-estimation-under-nonignorable-nonresponse-in-household-surveys-with-missing-covariate-information.pdf.
Search in Google Scholar Back to article
Pfeffermann, D. 2015. “Methodological issues and challenges in the production of official statistics: 24th Annual Morris Hansen Lecture.” Journal of Survey Statistics and Methodology 3(4): 425–483. DOI: https://doi.org/10.1093/jssam/smv035.
Search in Google Scholar Back to article
R Core Team 2021. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Available at: http://www.R-project.org/.
Search in Google Scholar Back to article
Riddles, M.K., J. K. Kim, and J. Im. 2016. “A propensity-score-adjustment method for non-ignorable nonresponse.” Journal of Survey Statistics and Methodology” 4(2): 215–245. DOI: https://doi.org/10.1093/jssam/smv047.
Search in Google Scholar Back to article
Rivers, D. 2007. “Sampling for web surveys.” In Proceedings of the Section on Survey Research Methods, Joint Statistical Meetings. Salt Lake City, Utah, July 29-August 2, 2007: 4127–4134. Alexandria, VA: American Statistical Association. Available at: http://www.websm.org/uploadi/editor/1368187629Rivers2007Samplingforwebsurveys.pdf (accessed June 2022).
Search in Google Scholar Back to article
Rosenbaum, P.R., and D.B. Rubin. 1983. “The central role of the propensity score in observational studies for causal effects.” Biometrika 70(1): 41–55. DOI: https://doi.org/10.1093/biomet/70.1.41.
Search in Google Scholar Back to article
Rubin, D.B. 1976. “Inference and missing data.” Biometrika 63(3): 581–592. DOI: https://doi.org/10.1093/biomet/63.3.581.
Search in Google Scholar Back to article
Sheather, S.J., and M.C. Jones. 1991. “A reliable data-based bandwidth selection method for Kernel density estimation.” Journal of the Royal Statistical Society. Series B-Statistical Methodology 53(3): 683–690. DOI: https://doi.org/10.2307/2345597.
Search in Google Scholar Back to article
Variyath, A. M., J. Chen, and B. Abraham. 2010. “Empirical likelihood based variable selection.” Journal of Statistical Planning and Inference 140(4): 971–981. DOI: https://doi.org/10.1016/j.jspi.2009.09.025.
Search in Google Scholar Back to article
Yang, S., J.K. Kim, and Y. Hwang. 2021a. “Integration of data from probability surveys and big found data for finite population inference using mass imputation.” Survey Methodology 47(1): 29–58. Available at: https://www150.statcan.gc.ca/n1/en/pub/12-001-x/2021001/article/00004-eng.pdf?st=-WLDQdr7.
Search in Google Scholar Back to article
Yang, S., J.K. Kim, and R. Song. 2021b. “Doubly Robust Inference when Combining Probability and Nonprobability Samples with High-dimensional Data.” Journal of the Royal Statistical Society. Series B-Statistical Methodology 82(2): 445–465. DOI: https://doi.org/10.1111/rssb.12354.
Search in Google Scholar Back to article

Adjusting for Selection Bias in Nonprobability Samples by Empirical Likelihood Approach

References

Paradigm

My account