Expectation-Maximization Estimation for Key-Value Data Randomized with Local Differential Privacy

Hikaru Horigome, Hiroaki Kikuchi*, Chia Mu Yu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper studies the local differential privacy (LDP) algorithm for key-value data that are pervasive in big data analysis. One of the state-of-the-arts algorithms, PrivKV, randomizes key-value pairs with a sequence of LDP algorithms. However, most likelihood estimation fails to estimate the statistics accurately when the frequency of the data for particular rare keys is limited. To address the problem, we propose the expectation-maximization-based algorithm designed for PrivKV. Instead of estimating continuous values [ - 1, 1 ] in key-value pairs, we focus on estimating the intermediate variable that contains the encoded binary bit ∈ { 1, - 1 }. This makes the problem tractable to estimate because we have a small set of possible input values and a set of observed outputs. We conduct some experiments using some synthetic data with some known distributions, e.g., Gaussian and power-law and well-known open datasets, MoveLens and Clothing. Our experiment using synthetic data and open datasets shows the robustness of estimation with regards to the size of data and the privacy budgets. The improvement is significant and the MSE of the proposed algorithm is 602.83 × 10 - 4 (41% of PrivKVM).

Original languageEnglish
Title of host publicationAdvanced Information Networking and Applications - Proceedings of the 37th International Conference on Advanced Information Networking and Applications AINA-2023
EditorsLeonard Barolli
PublisherSpringer Science and Business Media Deutschland GmbH
Pages501-512
Number of pages12
ISBN (Print)9783031284502
DOIs
StatePublished - 2023
Event37th International Conference on Advanced Information Networking and Applications, AINA 2023 - Juiz de Fora, Brazil
Duration: 29 Mar 202331 Mar 2023

Publication series

NameLecture Notes in Networks and Systems
Volume654 LNNS
ISSN (Print)2367-3370
ISSN (Electronic)2367-3389

Conference

Conference37th International Conference on Advanced Information Networking and Applications, AINA 2023
Country/TerritoryBrazil
CityJuiz de Fora
Period29/03/2331/03/23

Fingerprint

Dive into the research topics of 'Expectation-Maximization Estimation for Key-Value Data Randomized with Local Differential Privacy'. Together they form a unique fingerprint.

Cite this