Enhanced Diagnosis of Influenza and COVID-19 Using  Machine Learning

Dang Nhu Phu; Phan Cong Vinh; Nguyen Kim Quoc

doi:10.4108/eetcasa.v9i1.4030

Authors

Dang Nhu Phu Trường ĐH Nguyễn Tất Thành
Phan Cong Vinh Trường ĐH Nguyễn Tất Thành
Nguyen Kim Quoc Trường ĐH Nguyễn Tất Thành

DOI:

https://doi.org/10.4108/eetcasa.v9i1.4030

Keywords:

COVID-19, influenza-like illness, machine learning, predictive modeling, xgboost, random forest, diagnostic parameters, clinical symptoms, disease detection, pandemic response strategies

Abstract

The Coronavirus Disease 2019 (COVID-19) has rapidly spread globally, causing a significant impact on public health. This study proposes a predictive model employing machine learning techniques to distinguish between influenza-like illness and COVID-19 based on clinical symptoms and diagnostic parameters. Leveraging a dataset sourced from BMC Med Inform Decis Mak, comprising cases of influenza and COVID-19, we explore a diverse set of features, including clinical symptoms and blood assay parameters. Two prominent machine learning algorithms, XGBoost and Random Forest, are employed and compared for their predictive capabilities. The XGBoost model, in particular, demonstrates superior accuracy with an AUC under the ROC curve of 98.8%, showcasing its potential for clinical diagnosis, especially in settings with limited specialized testing equipment. Our model's practical applicability in community-based testing positions it as a valuable tool for efficient COVID-19 detection. This study advances the field of predictive modeling for disease detection, offering promising prospects for improved public health outcomes and pandemic response strategies. The model's reliability and effectiveness make it a valuable asset in the ongoing fight against the COVID-19 pandemic.

References

Davide Brinati, Andrea Campagner1, Davide Ferrari, Massimo Locatelli, Giuseppe Banfi, Federico Cabitza (2020). Detection of COVID-19 Infection from Routine Blood Exams with Machine Learning:A Feasibility Study. Journal of Medical Systems. Springer. DOI: https://doi.org/10.1101/2020.04.22.20075143

Wei Tse Li, Jiayan Ma, Neil Shende, Grant Castaneda, Jaideep Chakladar, Joseph C. Tsai, Lauren Apostol, Christine O. Honda, Jingyue Xu, Lindsay M. Wong, Tianyi Zhang, Abby Lee, Aditi Gnanasekar, Thomas K. Honda, Selena Z. Kuo, Michael Andrew Yu4, Eric Y. Chang, Mahadevan, Rajasekaran and Weg M. Ongkeko (2020). Using machine learning of clinical data to diagnose COVID-19: a systematic review and meta-analysis. BMC Medical Informatics and Decision Making, BCM.

Pablo Sieber, Domenica Flury, Sabine Güsewell, Werner C. Albrich, Katia Boggian, Céline Gardiol,Matthias Schlegel1, Robert Sieber, Pietro Vernazza1 and Philipp Kohler (2021). Characteristics of patients with Coronavirus Disease 2019 (COVID-19) and seasonal influenza at time of hospital admission: a single center comparative study. BMC Infectious Diseases, BCM. DOI: https://doi.org/10.1186/s12879-021-05957-4

Xueyan Mei et al. (2020). Artificial intelligence–enabled rapid diagnosis of patients with COVID-19. Nat Med, BCM. DOI: https://doi.org/10.1038/s41591-020-0931-3

BMC Infectious Diseases (2020). Dataset. https://doi.org/10.1186/s12879-020-05551-1. BMC Infectious Diseases.

Tianqi Chen, Tong He Michael Benesty, Vadim Khotilovich,Yuan Tang (2017). Extreme Gradient Boosting. CRAN.

Leo Breiman (2001). Random Forests. Statistics Department University of California Berkeley, CA 94720, 2001.

Brett Lantz (2015). Machine Learning with R. page 331, Packt.