Have a personal or library account? Click to login
Clustimpute: k-means Clustering with Built-in Missing Data Imputation Cover

Clustimpute: k-means Clustering with Built-in Missing Data Imputation

By: Oliver Pfaffel  
Open Access
|Aug 2025

References

  1. Van Buuren S. Flexible imputation of missing data. Chapman and Hall/CRC; 2018. DOI: 10.1201/9780429492259
  2. van Buuren S, Groothuis-Oudshoorn K. mice: Multivariate imputation by chained equations in r. Journal of Statistical Software. 2011;45(3):167. URL: https://www.jstatsoft.org/v45/i03/.
  3. Mouselimis L. Clusterr: Gaussian mixture models, k-means, mini-batch-kmeans and k-medoids clustering; 2020. URL: https://CRAN.R-project.org/package=ClusterR. R package version 1.2.1.
  4. Eddelbuettel D, Sanderson C. Rcpparmadillo: ‘rcpp’ integration for the ‘armadillo’ templated linear algebra library; 2020. URL: https://CRAN.R-project.org/package=RcppArmadillo. R package version 0.9.850.1.0.
  5. Wickham H, François R, Henry L, Müller K. dplyr: A grammar of data manipulation; 2020. URL: https://CRAN.R-project.org/package=dplyr. R package version 0.8.5.
  6. Henry L, Wickham H. rlang: Functions for base types and core r and ‘tidyverse’ features; 2020. URL: https://CRAN.R-project.org/package=rlang. R package version 0.4.5.
  7. Wickham H. testthat: Unit testing for r; 2020. URL: https://CRAN.R-project.org/package=testthat. R package version 2.3.2.
  8. van Buuren S, Groothuis-Oudshoorn K. mice: Multivariate imputation by chained equations in r; 2020. URL: https://CRAN.R-project.org/package=mice. R package version 3.8.0.
  9. Honaker J, King G, Blackwell M. Amelia II: A program for missing data. Journal of Statistical Software. 2011;45(7):147. URL: https://www.jstatsoft.org/v45/i07/.
  10. Honaker J, King G, Blackwell M. Amelia ii: A program for missing data; 2020. URL: https://CRAN.R-project.org/package=Amelia. R package version 1.7.6.
  11. Stekhoven DJ, Bühlmann P. Missforest—non-parametric missing value imputation for mixed-type data. Bioinformatics. 2012;28(1):112118. DOI: 10.1093/bioinformatics/btr597
  12. Berndt P. missranger: Fast imputation of missing values; 2020. URL: https://CRAN.R-project.org/package=missRanger. R package version 2.1.3.
  13. Wright MN, Ziegler A. ranger: A fast implementation of random forests; 2020. URL: https://CRAN.R-project.org/package=ranger. R package version 0.12.1.
  14. Little RJA, Rubin DB. Statistical analysis with missing data, volume 793. John Wiley & Sons; 2019. DOI: 10.1002/9781119482260
  15. Rand WM. Objective criteria for the evaluation of clustering methods. Journal of the American Statistical association. 1971;66(336):846850. DOI: 10.1080/01621459.1971.10482356
  16. Fisher RA. The use of multiple measurements in taxonomic problems. Annals of Eugenics. 1936;7(2):179188. DOI: 10.1111/j.1469-1809.1936.tb02137.x
  17. Sugino KY, Hernandez TL, Barbour LA, Kofonow JM, Frank DN, Friedman JE. Distinct plasma metabolomic and gut microbiome profiles after gestational diabetes mellitus diet treatment: Implications for personalized dietary interventions. Microorganisms. 2024;12(7):1369. DOI: 10.3390/microorganisms12071369
  18. Knight EC, Carlisle J, Boyce AJ, Bradley D, Cimprich P, Coates S, Dinsmore SJ, Gregory CJ, Jorgensen JG, Kelly JF, et al. Delineating ecologically distinct groups for annual cycle management of a declining shorebird. Journal of Applied Ecology. 2025;62(5):11521165. DOI: 10.1111/1365-2664.14885
  19. Nowinski B, Feng X, Preston CM, Birch JM, Luo H, Whitman WB, Moran MA. Ecological divergence of syntopic marine bacterial species is shaped by gene content and expression. The ISME Journal. 2023;17(6):813822. DOI: 10.1038/s41396-023-01390-4
  20. Xu D, Hu PJ-H, Fang X. Deep learning-based imputation method to enhance crowdsourced data on online business directory platforms for improved services. Journal of Management Information Systems. 2023;40(2):624654. DOI: 10.1080/07421222.2023.2196770
DOI: https://doi.org/10.5334/jors.345 | Journal eISSN: 2049-9647
Language: English
Submitted on: Aug 22, 2020
|
Accepted on: Aug 7, 2025
|
Published on: Aug 18, 2025
Published by: Ubiquity Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2025 Oliver Pfaffel, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.