TY - JOUR
T1 - Towards Accurate Identification of Antibiotic-Resistant Pathogens through the Ensemble of Multiple Preprocessing Methods Based on MALDI-TOF Spectra
AU - Chung, Chia Ru
AU - Wang, Hsin Yao
AU - Chou, Po Han
AU - Wu, Li Ching
AU - Lu, Jang Jih
AU - Horng, Jorng Tzong
AU - Lee, Tzong Yi
N1 - Publisher Copyright:
© 2023 by the authors.
PY - 2023/1
Y1 - 2023/1
N2 - Matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry (MS) has been used to identify microorganisms and predict antibiotic resistance. The preprocessing method for the MS spectrum is key to extracting critical information from complicated MS spectral data. Different preprocessing methods yield different data, and the optimal approach is unclear. In this study, we adopted an ensemble of multiple preprocessing methods––FlexAnalysis, MALDIquant, and continuous wavelet transform-based methods––to detect peaks and build machine learning classifiers, including logistic regressions, naïve Bayes classifiers, random forests, and a support vector machine. The aim was to identify antibiotic resistance in Acinetobacter baumannii, Acinetobacter nosocomialis, Enterococcus faecium, and Group B Streptococci (GBS) based on MALDI-TOF MS spectra collected from two branches of a referral tertiary medical center. The ensemble method was compared with the individual methods. Random forest models built with the data preprocessed by the ensemble method outperformed individual preprocessing methods and achieved the highest accuracy, with values of 84.37% (A. baumannii), 90.96% (A. nosocomialis), 78.54% (E. faecium), and 70.12% (GBS) on independent testing datasets. Through feature selection, important peaks related to antibiotic resistance could be detected from integrated information. The prediction model can provide an opinion for clinicians. The discriminative peaks enabling better prediction performance can provide a reference for further investigation of the resistance mechanism.
AB - Matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) mass spectrometry (MS) has been used to identify microorganisms and predict antibiotic resistance. The preprocessing method for the MS spectrum is key to extracting critical information from complicated MS spectral data. Different preprocessing methods yield different data, and the optimal approach is unclear. In this study, we adopted an ensemble of multiple preprocessing methods––FlexAnalysis, MALDIquant, and continuous wavelet transform-based methods––to detect peaks and build machine learning classifiers, including logistic regressions, naïve Bayes classifiers, random forests, and a support vector machine. The aim was to identify antibiotic resistance in Acinetobacter baumannii, Acinetobacter nosocomialis, Enterococcus faecium, and Group B Streptococci (GBS) based on MALDI-TOF MS spectra collected from two branches of a referral tertiary medical center. The ensemble method was compared with the individual methods. Random forest models built with the data preprocessed by the ensemble method outperformed individual preprocessing methods and achieved the highest accuracy, with values of 84.37% (A. baumannii), 90.96% (A. nosocomialis), 78.54% (E. faecium), and 70.12% (GBS) on independent testing datasets. Through feature selection, important peaks related to antibiotic resistance could be detected from integrated information. The prediction model can provide an opinion for clinicians. The discriminative peaks enabling better prediction performance can provide a reference for further investigation of the resistance mechanism.
KW - antibiotic resistance
KW - machine learning
KW - MALDI-TOF MS
UR - http://www.scopus.com/inward/record.url?scp=85146712812&partnerID=8YFLogxK
U2 - 10.3390/ijms24020998
DO - 10.3390/ijms24020998
M3 - Article
AN - SCOPUS:85146712812
SN - 1661-6596
VL - 24
JO - International Journal Of Molecular Sciences
JF - International Journal Of Molecular Sciences
IS - 2
M1 - 998
ER -