Have a personal or library account? Click to login
A Simple Method for Limiting Disclosure in Continuous Microdata Based on Principal Component Analysis Cover

A Simple Method for Limiting Disclosure in Continuous Microdata Based on Principal Component Analysis

By: Aida Calviño  
Open Access
|Feb 2017

Abstract

In this article we propose a simple and versatile method for limiting disclosure in continuous microdata based on Principal Component Analysis (PCA). Instead of perturbing the original variables, we propose to alter the principal components, as they contain the same information but are uncorrelated, which permits working on each component separately, reducing processing times. The number and weight of the perturbed components determine the level of protection and distortion of the masked data. The method provides preservation of the mean vector and the variance-covariance matrix. Furthermore, depending on the technique chosen to perturb the principal components, the proposed method can provide masked, hybrid or fully synthetic data sets. Some examples of application and comparison with other methods previously proposed in the literature (in terms of disclosure risk and data utility) are also included.

Language: English
Page range: 15 - 41
Submitted on: Sep 1, 2015
|
Accepted on: Aug 1, 2016
|
Published on: Feb 21, 2017
Published by: Sciendo
In partnership with: Paradigm Publishing Services
Publication frequency: 4 issues per year

© 2017 Aida Calviño, published by Sciendo
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.