TY - JOUR
T1 - A novel smart photoelectric lock system
T2 - Speech transmitted by laser and speech to text
AU - Guo, Cheng Yan
AU - Hsieh, Tung Li
AU - Chang, Chia Chi
AU - Perng, Jau Woei
N1 - Publisher Copyright:
© 2023
PY - 2023/3
Y1 - 2023/3
N2 - We propose a circuit that modulates a speech signal to a laser, using which the speech signal can be transmitted using the laser. Also, it shows the use of a platform based on embedded ARM (Advanced RISC Machine), running a small deep learning model based on TDNN (Time delay neural network) and LSTM (Long short-term memory), and converting speech to text, and use the text cipher for unlocking. This research implements a smart lock system that can set a pre-record speech cipher and verify the similarity through a laser transmission speech cipher to unlock it. In our experiment result, the English speech of laser transmission can reach a WER (Word error rate) of 14.06% through the deep learning model to recognize the content of the speech cipher. We also design a similarity comparison algorithm based on LCS (Longest common subsequence) to compare the character set of the laser transmission speech compare and the prerecord speech cipher to calculate the similarity rate. Through the similarity comparison algorithm, when the WER is 27.27%, the male speech samples used in this study still have a 95% unlocking success rate, while the female speech samples have a 100% unlocking success rate. Compared with only using automatic speech recognition (ASR) to unlock, the method we propose is to compare the similarity of the content of speech cipher. The method significantly improves the unlocking fault tolerance of using lasers to transmit audio. Therefore, by using the laser to transmit the speech cipher, the usability of the photoelectric smart lock system has been significantly improved. At the same time, the characteristics of the laser are not easy to eavesdrop on the cipher, which can also improve security.
AB - We propose a circuit that modulates a speech signal to a laser, using which the speech signal can be transmitted using the laser. Also, it shows the use of a platform based on embedded ARM (Advanced RISC Machine), running a small deep learning model based on TDNN (Time delay neural network) and LSTM (Long short-term memory), and converting speech to text, and use the text cipher for unlocking. This research implements a smart lock system that can set a pre-record speech cipher and verify the similarity through a laser transmission speech cipher to unlock it. In our experiment result, the English speech of laser transmission can reach a WER (Word error rate) of 14.06% through the deep learning model to recognize the content of the speech cipher. We also design a similarity comparison algorithm based on LCS (Longest common subsequence) to compare the character set of the laser transmission speech compare and the prerecord speech cipher to calculate the similarity rate. Through the similarity comparison algorithm, when the WER is 27.27%, the male speech samples used in this study still have a 95% unlocking success rate, while the female speech samples have a 100% unlocking success rate. Compared with only using automatic speech recognition (ASR) to unlock, the method we propose is to compare the similarity of the content of speech cipher. The method significantly improves the unlocking fault tolerance of using lasers to transmit audio. Therefore, by using the laser to transmit the speech cipher, the usability of the photoelectric smart lock system has been significantly improved. At the same time, the characteristics of the laser are not easy to eavesdrop on the cipher, which can also improve security.
KW - LCS
KW - LSTM
KW - Photoelectric smart lock
KW - Speech recognition
KW - TDNN
UR - http://www.scopus.com/inward/record.url?scp=85150372424&partnerID=8YFLogxK
U2 - 10.1016/j.heliyon.2023.e14510
DO - 10.1016/j.heliyon.2023.e14510
M3 - Article
AN - SCOPUS:85150372424
SN - 2405-8440
VL - 9
JO - Heliyon
JF - Heliyon
IS - 3
M1 - e14510
ER -