en
Scientific article
Open access
English

Differentiation of COVID‐19 pneumonia from other lung diseases using CT radiomic features and machine learning : A large multicentric cohort study

Published inInternational journal of imaging systems and technology, vol. 34, no. 2, e23028
Publication date2024-02
First online date2024-02
Abstract

To derive and validate an effective machine learning and radiomics‐based model to differentiate COVID‐19 pneumonia from other lung diseases using a large multi‐centric dataset. In this retrospective study, we collected 19 private and five public datasets of chest CT images, accumulating to 26 307 images (15 148 COVID‐19; 9657 other lung diseases including non‐COVID‐19 pneumonia, lung cancer, pulmonary embolism; 1502 normal cases). We tested 96 machine learning‐based models by cross‐combining four feature selectors (FSs) and eight dimensionality reduction techniques with eight classifiers. We trained and evaluated our models using three different strategies: #1, the whole dataset (15 148 COVID‐19 and 11 159 other); #2, a new dataset after excluding healthy individuals and COVID‐19 patients who did not have RT‐PCR results (12 419 COVID‐19 and 8278 other); and #3 only non‐COVID‐19 pneumonia patients and a random sample of COVID‐19 patients (3000 COVID‐19 and 2582 others) to provide balanced classes. The best models were chosen by one‐standard‐deviation rule in 10‐fold cross‐validation and evaluated on the hold out test sets for reporting. In strategy#1, Relief FS combined with random forest (RF) classifier resulted in the highest performance (accuracy = 0.96, AUC = 0.99, sensitivity = 0.98, specificity = 0.94, PPV = 0.96, and NPV = 0.96). In strategy#2, Recursive Feature Elimination (RFE) FS and RF classifier combination resulted in the highest performance (accuracy = 0.97, AUC = 0.99, sensitivity = 0.98, specificity = 0.95, PPV = 0.96, NPV = 0.98). Finally, in strategy #3, the ANOVA FS and RF classifier combination resulted in the highest performance (accuracy = 0.94, AUC =0.98, sensitivity = 0.96, specificity = 0.93, PPV = 0.93, NPV = 0.96). Lung radiomic features combined with machine learning algorithms can enable the effective diagnosis of COVID‐19 pneumonia in CT images without the use of additional tests.

eng
Citation (ISO format)
SHIRI LORD, Isaac et al. Differentiation of COVID‐19 pneumonia from other lung diseases using CT radiomic features and machine learning : A large multicentric cohort study. In: International journal of imaging systems and technology, 2024, vol. 34, n° 2, p. e23028. doi: 10.1002/ima.23028
Main files (1)
Article (Published version)
Secondary files (1)
Identifiers
ISSN of the journal0899-9457
17views
3downloads

Technical informations

Creation02/01/2024 4:33:41 PM
First validation05/27/2024 2:44:31 PM
Update time05/27/2024 2:44:31 PM
Status update05/27/2024 2:44:31 PM
Last indexation05/27/2024 2:44:50 PM
All rights reserved by Archive ouverte UNIGE and the University of GenevaunigeBlack