DPView: Differentially Private Data Synthesis Through Domain Size Information

Chih Hsun Lin, Chia Mu Yu*, Chun Ying Huang

*此作品的通信作者

研究成果: Article同行評審

3 引文 斯高帕斯(Scopus)

摘要

The use of differentially private synthetic data has been adopted as a common security measure for the public release of sensitive data. However, the existing solutions either suffer from serious privacy budget splitting or fail to fully automate the generation procedures. In this study, we propose an automated system for synthesizing differentially private synthetic tabular data, called DPView. Our key insight is that high-dimensional data synthesis can be accomplished by utilizing the domain sizes of attributes, which are public information, whereas identifying the correlation among attributes is necessary but leads to severe privacy budget splitting. In addition, we analytically optimize both the privacy budget allocation and consistency procedures of the proposed method through mathematical programming. We further propose two novel methods, including iterative non-negativity and consistency-aware normalization, to postprocess the synthetic data. An extensive set of experimental results demonstrates the superior utility of DPView.

原文English
頁(從 - 到)15886-15900
頁數15
期刊IEEE Internet of Things Journal
9
發行號17
DOIs
出版狀態Published - 1 9月 2022

指紋

深入研究「DPView: Differentially Private Data Synthesis Through Domain Size Information」主題。共同形成了獨特的指紋。

引用此