TY - JOUR
T1 - Data-Driven Two-Stage Framework for Identification and Characterization of Different Antibiotic-Resistant Escherichia coli Isolates Based on Mass Spectrometry Data
AU - Chung, Chia Ru
AU - Wang, Hsin Yao
AU - Yao, Chun Han
AU - Wu, Li Ching
AU - Lu, Jang Jih
AU - Horng, Jorng Tzong
AU - Lee, Tzong Yi
N1 - Publisher Copyright:
Copyright © 2023 Chung et al.
PY - 2023/5
Y1 - 2023/5
N2 - In clinical microbiology, matrix-assisted laser desorption ionization–time-of-flight mass spectrometry (MALDI-TOF MS) is frequently employed for rapid microbial identification. However, rapid identification of antimicrobial resistance (AMR) in Escherichia coli based on a large amount of MALDI-TOF MS data has not yet been reported. This may be because building a prediction model to cover all E. coli isolates would be challenging given the high diversity of the E. coli population. This study aimed to develop a MALDI-TOF MS-based, data-driven, two-stage framework for characterizing different AMRs in E. coli. Specifically, amoxicillin (AMC), ceftazidime (CAZ), ciprofloxacin (CIP), ceftriaxone (CRO), and cefuroxime (CXM) were used. In the first stage, we split the data into two groups based on informative peaks according to the importance of the random forest. In the second stage, prediction models were constructed using four different machine learning algorithms2logistic regression, support vector machine, random forest, and extreme gradient boosting (XGBoost). The findings demonstrate that XGBoost outperformed the other four machine learning models. The values of the area under the receiver operating characteristic curve were 0.62, 0.72, 0.87, 0.72, and 0.72 for AMC, CAZ, CIP, CRO, and CXM, respectively. This implies that a data-driven, two-stage framework could improve accuracy by approximately 2.8%. As a result, we developed AMR prediction models for E. coli using a data-driven two-stage framework, which is promising for assisting physicians in making decisions. Further, the analysis of informative peaks in future studies could potentially reveal new insights.
AB - In clinical microbiology, matrix-assisted laser desorption ionization–time-of-flight mass spectrometry (MALDI-TOF MS) is frequently employed for rapid microbial identification. However, rapid identification of antimicrobial resistance (AMR) in Escherichia coli based on a large amount of MALDI-TOF MS data has not yet been reported. This may be because building a prediction model to cover all E. coli isolates would be challenging given the high diversity of the E. coli population. This study aimed to develop a MALDI-TOF MS-based, data-driven, two-stage framework for characterizing different AMRs in E. coli. Specifically, amoxicillin (AMC), ceftazidime (CAZ), ciprofloxacin (CIP), ceftriaxone (CRO), and cefuroxime (CXM) were used. In the first stage, we split the data into two groups based on informative peaks according to the importance of the random forest. In the second stage, prediction models were constructed using four different machine learning algorithms2logistic regression, support vector machine, random forest, and extreme gradient boosting (XGBoost). The findings demonstrate that XGBoost outperformed the other four machine learning models. The values of the area under the receiver operating characteristic curve were 0.62, 0.72, 0.87, 0.72, and 0.72 for AMC, CAZ, CIP, CRO, and CXM, respectively. This implies that a data-driven, two-stage framework could improve accuracy by approximately 2.8%. As a result, we developed AMR prediction models for E. coli using a data-driven two-stage framework, which is promising for assisting physicians in making decisions. Further, the analysis of informative peaks in future studies could potentially reveal new insights.
KW - antimicrobial resistance
KW - cephalosporin
KW - cephalosporins
KW - fluoroquinolones
KW - machine learning
KW - MALDI-TOF MS
KW - matrix-assisted laser desorption ionization–time of flight mass spectrometry
KW - penicillin
KW - penicillins
UR - http://www.scopus.com/inward/record.url?scp=85163914180&partnerID=8YFLogxK
U2 - 10.1128/spectrum.03479-22
DO - 10.1128/spectrum.03479-22
M3 - Article
C2 - 37042778
AN - SCOPUS:85163914180
SN - 2165-0497
VL - 11
JO - Microbiology spectrum
JF - Microbiology spectrum
IS - 3
ER -