TY - JOUR
T1 - On the Private Data Synthesis Through Deep Generative Models for Data Scarcity of Industrial Internet of Things
AU - Chen, Yen Ting
AU - Hsu, Chia Yi
AU - Yu, Chia Mu
AU - Barhamgi, Mahmoud
AU - Perera, Charith
N1 - Publisher Copyright:
© 2005-2012 IEEE.
PY - 2023/1/1
Y1 - 2023/1/1
N2 - Due to the data-driven intelligence from the recent deep learning based approaches, the huge amount of data collected from various kinds of sensors from industrial devices have the potential to revolutionize the current technologies used in the industry. To improve the efficiency and quality of machines, the machine manufacturer needs to acquire the history of the machine operation process. However, due to the business secrecy, the factories are not willing to do so. One promising solution to the abovementioned difficulty is the synthetic dataset and an informatic network structure, both through deep generative models such as differentially private generative adversarial networks. Hence, this article initiates the study of the utility difference between the abovementioned two kinds. We carry out an empirical study and find that the classifier generated by private informatic network structure is more accurate than the classifier generated by private synthetic data, with approximately 0.31-7.66%.
AB - Due to the data-driven intelligence from the recent deep learning based approaches, the huge amount of data collected from various kinds of sensors from industrial devices have the potential to revolutionize the current technologies used in the industry. To improve the efficiency and quality of machines, the machine manufacturer needs to acquire the history of the machine operation process. However, due to the business secrecy, the factories are not willing to do so. One promising solution to the abovementioned difficulty is the synthetic dataset and an informatic network structure, both through deep generative models such as differentially private generative adversarial networks. Hence, this article initiates the study of the utility difference between the abovementioned two kinds. We carry out an empirical study and find that the classifier generated by private informatic network structure is more accurate than the classifier generated by private synthetic data, with approximately 0.31-7.66%.
KW - Data synthesis
KW - deep generative model (DGM)
KW - differential privacy (DP)
KW - generative adversarial network (GAN)
KW - industrial Internet of Things (IIoT)
UR - http://www.scopus.com/inward/record.url?scp=85142522586&partnerID=8YFLogxK
U2 - 10.1109/TII.2021.3133625
DO - 10.1109/TII.2021.3133625
M3 - Article
AN - SCOPUS:85142522586
SN - 1551-3203
VL - 19
SP - 551
EP - 560
JO - IEEE Transactions on Industrial Informatics
JF - IEEE Transactions on Industrial Informatics
IS - 1
ER -