Have a personal or library account? Click to login
Adjusting for Selection Bias in Nonprobability Samples by Empirical Likelihood Approach Cover

Adjusting for Selection Bias in Nonprobability Samples by Empirical Likelihood Approach

By: Daniela Marella  
Open Access
|Jun 2023

References

  1. Agresti, A. 2007. An Introduction to Categorical Data Analysis (second edition). John Wiley & Sons, Inc., Hoboken: New Jersey.
  2. Babu, G.J., and C.R. Rao. 2004. “Goodness-of-Fit Tests when Parameters are Estimated.” Sankhyā. Series A 66(1): 63–74. DOI: https://doi.org/10.2307/25053332.
  3. Banca d’Italia. 2012. Supplement to the Statistical Bulletin, Sample Surveys, Household income and wealth in 2010: 12(6). Available at: https://www.bancaditalia.it/pubblicazioni/indagine-famiglie/bil-fam2010.
  4. Beaumont, J.F. 2000. “An Estimation Method for Nonignorable Nonresponse.” Survey Methodology 26(2): 131–136. Available at: https://www150.statcan.gc.ca/n1/en/pub/12-001-x/2000002/article/5532-eng.pdf?st=WJWdN_3l.
  5. Belzile, L., J.L. Wadsworth, P.J. Northrop, S.D. Grimshaw, J. Zhang, M.A. Stephens, A.B. Owen, and R. Huser. 2022. mev: Modelling Extreme Values. R package version 1.14 Available at: https://CRAN.R-project.org/package=mev (accessed June 2022).
  6. Beresewicz, M., R. Lehtonen, F. Reis, L. Di Consiglio and M. Karlberg. 2018. An overview of methods for treating selectivity in big data sources. Statistical Working Papers, Eurostat. Available at: https://ec.europa.eu/eurostat/web/products-statistical-working-papers/-/ks-tc-18-004 (accessed July 2022).
  7. Chang, T., and P.S. Kott. 2008. “Using Calibration Weighting to Adjust for Nonresponse under a Plausible Model.” Biometrika 95(3): 555–571. DOI: https://doi.org/10.1093/-biomet/asn022.
  8. Chaudhuri S., M.S. Handcock, and M.S. Rendall. 2010. A conditional empirical likelihood approach to combine sampling design and population level information. Technical report No. 3/2010, National University of Singapore, Singapore. Available at: https://cpb-us-w2.wpmucdn.com/blog.nus.edu.sg/dist/0/14452/files/2020/10/tr032010.pdf (accessed July 2022).
  9. Chen, C., M. Wang, R. Wu, and R. Li. 2022. “A Robust Consistent Information Criterion for Model Selection Based on Empirical Likelihood.” Statistica Sinica 32: 1205–1223. DOI: https://doi.org/10.5705/ss.202020.0254.
  10. Conti P.L., D. Marella, and M. Scanu. 2008. “Evaluation of matching noise for imputation techniques based on nonparametric local linear regression estimators.” Computational Statistics & Data Analysis 53(2): 354–365. DOI: https://doi.org/10.1016/j.csda.2008.07.041.
  11. DiSogra, C., C. Cobb, E. Chan, and J. M. Dennis. 2011. “Calibrating non-probability internet samples with probability samples using early adopter characteristics.” In Proceedings of the Section on Survey Research Methods, Joint Statistical Meetings. Miami Beach, Florida, July 30-August 4, 2011: 4501–4515. Alexandria, VA: American Statistical Association. Available at: http://www.asasrms.org/Proceedings/y2011/Files/30270468925.pdf (accessed June 2022).
  12. Elliott, M., and R. Valliant. 2017. “Inference for non-probability samples.” Statistical Science 32(2): 249–264. DOI: https://doi.org/10.1214/16-STS598.
  13. Feder, M., and D. Pfeffermann. 2019. Statistical Inference Under Non-ignorable Sampling and Non-response. An Empirical Likelihood Approach. Working paper. University of Southampton. Available at: https://eprints.soton.ac.uk/378245/ (accessed July 2022).
  14. Galimard J.E., S. Chevret, E. Curis, and M. Resche-Rigon. 2018. “Heckman imputation models for binary or continuous MNAR outcomes and MAR predictors.” BMC Medical Research Methodology 18(90). DOI: https://doi.org/10.1186/s12874-018-0547-1.
  15. Hájek, J. 1964. “Asymptotic theory of rejective sampling with varying probabilities from a finite population.” The Annals of Mathematical Statistics 35(4): 1491–1523. DOI: 10.1214/aoms/1177700375.
  16. Heckman, J.J. 1979. “Sample Selection Bias as a Specification Error.” Econometrica 47(1): 153–161. DOI: http://dx.doi.org/10.2307/1912352.
  17. Kim, J.K., and Z. Wang. 2019. “Sampling techniques for big data analysis in finite population inference.” International Statistical Review 87(S1): S177–S191. DOI: https://doi.org/10.1111/insr.12290.
  18. Kott, P.S. 2006. “Using calibration weighting to adjust for nonresponse and coverage errors.” Survey Methodology 32(2): 133–142. Available at: https://www150.statcan.gc.ca/n1/en/pub/12-001-x/2006002/article/9547-eng.pdf?st=B2aZNvo0.
  19. Kott, P.S., and T. Chang. 2010. “Using Calibration Weighting to Adjust for Nonignorable Unit Nonresponse.” Journal of the American Statistical Association 105(491): 1265–1275. DOI: https://doi.org/10.1198/jasa.2010.tm09016.
  20. Lee, J., and J.O. Berger. 2001. “Semiparametric Bayesian analysis of selection models.” Journal of the American Statistical Association 96(456): 1397–1409. DOI: https://doi.org/10.1198/016214501753382318.
  21. Marella D., M. Scanu, and P.L. Conti. 2008. “On the matching noise of some nonparametric imputation procedures.” Statistics & Probability Letters 78(12): 1593–1600. DOI: https://doi.org/10.1016/j.spl.2008.01.020.
  22. Marella, D., and D. Pfeffermann. 2019 “Matching Information from two independent informative samples.” Journal of Statistical Planning and Inference 203: 70–81. https://doi.org/10.1016/j.jspi.2019.03.001.
  23. Marella, D., and D. Pfeffermann. 2021 “Accounting for nonignorable sampling and nonresponse in statistical matching.” International Statistical Review. Accepted for pubblication. DOI: https://doi.org/10.1111/insr.12524.
  24. Meng, X-L., 2018. “Statistical paradises and paradoxes in big data (I): law of large populations, big data paradox and the 2016 US presidential election.” The Annals of Applied Statistics 12(2): 685–726. DOI: https://doi.org/10.1214/18-AOAS1161SF.
  25. Owen, A.B. 2001. Empirical Likelihood. Chapman & Hall/CRC: New York.
  26. Owen, A.B. 2013. “Self-concordance for empirical likelihood.” Canadian Journal ofStatistics 41(3): 387–397. DOI: https://doi.org/10.1002/cjs.11183.
  27. Pfeffermann, D., A.M. Krieger, and Y. Rinott. 1998. “Parametric distribution of complex survey data under informative probability sampling.” Statistica Sinica 8(4): 1087–1114.
  28. Pfeffermann, D., and M. Sverchkov. 2009. “Inference under Informative Sampling.” In Handbook of Statistics 29B: Sample Surveys: Inference and Analysis, edited by D. Pfeffermann and C.R. Rao.: 455–487. North Holland.
  29. Pfeffermann, D. 2011. “Modelling of complex survey data: Why model? Why is it a problem? How can we approach it?” Survey Methodology 37(2): 115–136. Available at: https://www150.statcan.gc.ca/n1/en/pub/12-001-x/2011002/article/11602-eng.pdf?st=_vXrCcPb.
  30. Pfeffermann, D., and V. Landsman. 2011. “Are private schools really better than public schools? Assessment by methods for observational studies.” Annals of Applied Statistics 5(3): 1726–1751. DOI: https://doi.org/10.1214/11-AOAS456.
  31. Pfeffermann, D., and A. Sikov. 2011. “Imputation and Estimation under Nonignorable Non-response in Household Surveys with Missing Covariate Information.” Journal of Official Statistics 27(2): 181–209. Available at: https://www.scb.se/contentassets/ff271eeeca694f47ae99b942de61df83/imputation-and-estimation-under-nonignorable-nonresponse-in-household-surveys-with-missing-covariate-information.pdf.
  32. Pfeffermann, D. 2015. “Methodological issues and challenges in the production of official statistics: 24th Annual Morris Hansen Lecture.” Journal of Survey Statistics and Methodology 3(4): 425–483. DOI: https://doi.org/10.1093/jssam/smv035.
  33. R Core Team 2021. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Available at: http://www.R-project.org/.
  34. Riddles, M.K., J. K. Kim, and J. Im. 2016. “A propensity-score-adjustment method for non-ignorable nonresponse.” Journal of Survey Statistics and Methodology” 4(2): 215–245. DOI: https://doi.org/10.1093/jssam/smv047.
  35. Rivers, D. 2007. “Sampling for web surveys.” In Proceedings of the Section on Survey Research Methods, Joint Statistical Meetings. Salt Lake City, Utah, July 29-August 2, 2007: 4127–4134. Alexandria, VA: American Statistical Association. Available at: http://www.websm.org/uploadi/editor/1368187629Rivers2007Samplingforwebsurveys.pdf (accessed June 2022).
  36. Rosenbaum, P.R., and D.B. Rubin. 1983. “The central role of the propensity score in observational studies for causal effects.” Biometrika 70(1): 41–55. DOI: https://doi.org/10.1093/biomet/70.1.41.
  37. Rubin, D.B. 1976. “Inference and missing data.” Biometrika 63(3): 581–592. DOI: https://doi.org/10.1093/biomet/63.3.581.
  38. Sheather, S.J., and M.C. Jones. 1991. “A reliable data-based bandwidth selection method for Kernel density estimation.” Journal of the Royal Statistical Society. Series B-Statistical Methodology 53(3): 683–690. DOI: https://doi.org/10.2307/2345597.
  39. Variyath, A. M., J. Chen, and B. Abraham. 2010. “Empirical likelihood based variable selection.” Journal of Statistical Planning and Inference 140(4): 971–981. DOI: https://doi.org/10.1016/j.jspi.2009.09.025.
  40. Yang, S., J.K. Kim, and Y. Hwang. 2021a. “Integration of data from probability surveys and big found data for finite population inference using mass imputation.” Survey Methodology 47(1): 29–58. Available at: https://www150.statcan.gc.ca/n1/en/pub/12-001-x/2021001/article/00004-eng.pdf?st=-WLDQdr7.
  41. Yang, S., J.K. Kim, and R. Song. 2021b. “Doubly Robust Inference when Combining Probability and Nonprobability Samples with High-dimensional Data.” Journal of the Royal Statistical Society. Series B-Statistical Methodology 82(2): 445–465. DOI: https://doi.org/10.1111/rssb.12354.
Language: English
Page range: 151 - 172
Submitted on: Jul 1, 2021
|
Accepted on: Jul 1, 2022
|
Published on: Jun 9, 2023
Published by: Sciendo
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2023 Daniela Marella, published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.