Expectation-Maximization Estimation for Key-Value Data Randomized with Local Differential Privacy

Hikaru Horigome, Hiroaki Kikuchi*, Chia Mu Yu

*此作品的通信作者

研究成果: Conference contribution同行評審

摘要

This paper studies the local differential privacy (LDP) algorithm for key-value data that are pervasive in big data analysis. One of the state-of-the-arts algorithms, PrivKV, randomizes key-value pairs with a sequence of LDP algorithms. However, most likelihood estimation fails to estimate the statistics accurately when the frequency of the data for particular rare keys is limited. To address the problem, we propose the expectation-maximization-based algorithm designed for PrivKV. Instead of estimating continuous values [ - 1, 1 ] in key-value pairs, we focus on estimating the intermediate variable that contains the encoded binary bit ∈ { 1, - 1 }. This makes the problem tractable to estimate because we have a small set of possible input values and a set of observed outputs. We conduct some experiments using some synthetic data with some known distributions, e.g., Gaussian and power-law and well-known open datasets, MoveLens and Clothing. Our experiment using synthetic data and open datasets shows the robustness of estimation with regards to the size of data and the privacy budgets. The improvement is significant and the MSE of the proposed algorithm is 602.83 × 10 - 4 (41% of PrivKVM).

原文English
主出版物標題Advanced Information Networking and Applications - Proceedings of the 37th International Conference on Advanced Information Networking and Applications AINA-2023
編輯Leonard Barolli
發行者Springer Science and Business Media Deutschland GmbH
頁面501-512
頁數12
ISBN(列印)9783031284502
DOIs
出版狀態Published - 2023
事件37th International Conference on Advanced Information Networking and Applications, AINA 2023 - Juiz de Fora, Brazil
持續時間: 29 3月 202331 3月 2023

出版系列

名字Lecture Notes in Networks and Systems
654 LNNS
ISSN(列印)2367-3370
ISSN(電子)2367-3389

Conference

Conference37th International Conference on Advanced Information Networking and Applications, AINA 2023
國家/地區Brazil
城市Juiz de Fora
期間29/03/2331/03/23

指紋

深入研究「Expectation-Maximization Estimation for Key-Value Data Randomized with Local Differential Privacy」主題。共同形成了獨特的指紋。

引用此