Have a personal or library account? Click to login
Impact of sample size on principal component analysis ordination of an environmental data set: effects on eigenstructure Cover

Impact of sample size on principal component analysis ordination of an environmental data set: effects on eigenstructure

Open Access
|May 2016

Abstract

In this study, we used bootstrap simulation of a real data set to investigate the impact of sample size (N = 20, 30, 40 and 50) on the eigenvalues and eigenvectors resulting from principal component analysis (PCA). For each sample size, 100 bootstrap samples were drawn from environmental data matrix pertaining to water quality variables (p = 22) of a small data set comprising of 55 samples (stations from where water samples were collected). Because in ecology and environmental sciences the data sets are invariably small owing to high cost of collection and analysis of samples, we restricted our study to relatively small sample sizes. We focused attention on comparison of first 6 eigenvectors and first 10 eigenvalues. Data sets were compared using agglomerative cluster analysis using Ward’s method that does not require any stringent distributional assumptions.

DOI: https://doi.org/10.1515/eko-2016-0014 | Journal eISSN: 1337-947X | Journal ISSN: 1335-342X
Language: English
Page range: 173 - 190
Published on: May 28, 2016
In partnership with: Paradigm Publishing Services
Publication frequency: 2 issues per year

© 2016 S. Shahid Shaukat, Toqeer Ahmed Rao, Moazzam A. Khan, published by Slovak Academy of Sciences, Institute of Landscape Ecology
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.