New data suggest machine learning algorithms can accurately predict C. diff infection in hospitalized patients

C. difficile
This photograph depicts Clostridium difficile colonies after 48hrs growth on a blood agar plate; Magnified 4.8X. C. difficile, an anaerobic gram-positive rod, is the most frequently identified cause of antibiotic-associated diarrhea (AAD). It accounts for approximately 15–25% of all episodes of AAD. Credit: CDC

New data published today suggest that several commonly used machine learning algorithms (MLAs) can effectively predict which hospitalized patients will become infected with Clostridiodes difficile (C. diff). The findings, which appear in the American Journal of Infection Control (AJIC), the journal of the Association for Professionals in Infection Control and Epidemiology (APIC), could support infection prevention and early diagnosis, as well as more timely implementation of infection control measures to minimize C. diff spread.

"Our study findings suggest that MLAs could play a significant role in reducing the clinical and economic impact of healthcare-associated infections such as C. diff by providing early predictions of at- prior to them developing serious complications," said Jana Hoffman, VP of Science, Dascena, Inc. "These data are consistent with a growing body of evidence that validates artificial intelligence and MLAs as integral components of healthcare management that can improve patient outcomes and assist time-constrained clinicians in providing the best patient care."

C. diff infection (CDI) is the leading cause of hospital-acquired diarrhea and is associated with significant morbidity, mortality, and healthcare costs. There is currently no gold standard tool to assess individual patients' risk of acquiring CDI. Hoffman and her colleagues have previously published data which demonstrate that MLAs can predict patients at risk of developing other high-impact HAIs.

For the study published today, the researchers used a database comprising electronic health record (EHR) patient data from more than 700 hospitals nationwide to train and then systematically evaluate three different, classical machine-learning and deep-learning methods. They initially assessed of each of these methods to determine whether they could effectively predict CDI among hospitalized patients using early inpatient data, and then used a distinct, external dataset to evaluate the generalizability of the best-performing MLA models.

Results suggest that MLAs can predict CDI with excellent discrimination using just the first six hours of inpatient data. Among the three methods studied, a machine-learning method called XGBoost provided the highest overall accuracy in predicting CDI, despite being the least complex model. XGBoost also demonstrated generalizability by maintaining its predictive performance in an external dataset. The other two methods researchers evaluated, known as Deep Long Short Term Memory (D-LSTM) and one-dimensional convolutional neural network (1D-CNN), also demonstrated high levels of predictive accuracy, though were less generalizable.

The best-performing XGBoost, D-LSTM and 1D-CNN models used similar features to predict CDI among patients, all of which have previously been identified as risk factors. In this study, age was the leading CDI risk factor, followed by clinical measurements such as sodium, body mass index, white blood cell count, and ; active treatment with antibiotics or ; glycated hemoglobin; and race.

"This study supports earlier research suggesting that MLAs provide reliable -risk prediction that can empower clinical teams to implement appropriate infection control measures at earlier time points and thereby improve healthcare outcomes," said Linda Dickey, RN, MPH, CIC, FAPIC, and 2022 APIC president.

More information: Saarang Panchavati et al, A comparative analysis of machine learning approaches to predict C. difficile infection in hospitalized patients, American Journal of Infection Control (2022). DOI: 10.1016/j.ajic.2021.11.012

Provided by Association for Professionals in Infection Control
Citation: New data suggest machine learning algorithms can accurately predict C. diff infection in hospitalized patients (2022, January 20) retrieved 25 February 2024 from
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researchers create AI algorithm to improve timeliness, accuracy of sepsis predictions


Feedback to editors