Classification Algorithms for Liver Epidemic Identification

Authors

  • Koteswara Rao Makkena Vellore Institute of Technology University image/svg+xml
  • Karthika Natarajan Vellore Institute of Technology University image/svg+xml

DOI:

https://doi.org/10.4108/eetpht.9.4379

Keywords:

medical care, liver epidemic, prognosis, classification models, Sythetic Minority Over Sampling Technique

Abstract

Situated in the upper right region of the abdomen, beneath the diaphragm and above the stomach, lies the liver. It is a crucial organ essential for the proper functioning of the body.  The principal tasks are to eliminate generated waste produced by our organs, and digestive food and preserve vitamins and energy materials. It performs many important functions in the body, it regulates the balance of hormones in the body filtering and removing bacteria, viruses, and other harmful substances from the blood. In certain dire circumstances, the outcome can unfortunately result in fatality. There exist numerous classifications of liver diseases, based on their causes or distinguishing characteristics. Some common categories of liver disease include Viral hepatitis, Autoimmune liver disease, Metabolic liver disease, Alcohol-related liver disease, Non-alcoholic fatty liver disease, Genetic liver disease, Drug-induced liver injury, Biliary tract disorders. Machine learning algorithms can help identify patterns and risk factors that may be difficult for humans to detect. With this clinicians can enable early diagnosis of diseases, leading to better treatment outcomes and improved patient care. In this research work, different types of machine learning methods are implemented and compared in terms of performance metrics to identify whether a person effected or not. The algorithms used here for predicting liver patients are Random Forest classifier, K-nearest neighbor, XGBoost, Decision tree, Logistic Regression, support vector machine, Extra Trees Classifier. The experimental results showed that the accuracy of various machine learning models-Random Forest classifier-67.4%, K-nearest neighbor-54.8%, XGBoost-72%, Decision tree-65.1%, Logistic Regression-68.0%, support vector machine-65.1%, Extra Trees Classifier-70.2% after applying Synthetic Minority Over-sampling technique.

Downloads

Download data is not yet available.

References

Arias, I.M.; Alter, H.J.; Boyer, J.L.; Cohen, D.E.; Shafritz, D.A.; Thorgeirsson, S.S.; Wolkoff, A.W. The Liver: Biology and Pathobiology;John Wiley & Sons: Hoboken, NJ, USA, 2020. DOI: https://doi.org/10.1002/9781119436812

Singh, H.R.; Rabi, S. Study of morphological variations of liver in human. Transl. Res. Anat. 2019, 14, 1–5. DOI: https://doi.org/10.1016/j.tria.2018.11.004

Razavi, H. Global epidemiology of viral hepatitis. Gastroenterol. Clin. 2020, 49, 179–189. DOI: https://doi.org/10.1016/j.gtc.2020.01.001

Seitz, H.K., Bataller, R., Cortez-Pinto, H. et al. Alcoholic liver disease. Nat Rev Dis Primers 4, 16 (2018). https://doi.org/10.1038/s41572-018-0014-7

Powell, E.E.;Wong, V.W.S.; Rinella, M. Non-alcoholic fatty liver disease. Lancet 2021, 397, 2212–2224. DOI: https://doi.org/10.1016/S0140-6736(20)32511-3

Ringehan, M.; McKeating, J.A.; Protzer, U. Viral hepatitis and liver cancer. Philos. Trans. R. Soc. B Biol. Sci. 2017, 372, 20160274. DOI: https://doi.org/10.1098/rstb.2016.0274

Smith, A.; Baumgartner, K.; Bositis, C. Cirrhosis: Diagnosis and management. Am. Fam. Physician 2019, 100, 759–770.

https://www.cdc.gov/hepatitis/hav/pdfs/HepAGeneralFactSheet.pdf

Yuen, M.F.; Chen, D.S.; Dusheiko, G.M.; Janssen, H.L.; Lau, D.T.; Locarnini, S.A.; Peters, M.G.; Lai, C.L. Hepatitis B virus infection.Nat. Rev. Dis. Prim. 2018, 4, 1–20. DOI: https://doi.org/10.1038/nrdp.2018.35

Manns, M.P.; Buti, M.; Gane, E.; Pawlotsky, J.M.; Razavi, H.; Terrault, N.; Younossi, Z. Hepatitis C virus infection. Nat. Rev. Dis.Prim. 2017, 3, 1–19. DOI: https://doi.org/10.1038/nrdp.2017.6

Mentha, N.; Clément, S.; Negro, F.; Alfaiate, D. A review on hepatitis D: From virology to new therapies. J. Adv. Res. 2019,17, 3–15. DOI: https://doi.org/10.1016/j.jare.2019.03.009

Kamar, N.; Izopet, J.; Pavio, N.; Aggarwal, R.; Labrique, A.;Wedemeyer, H.; Dalton, H.R. Hepatitis E virus infection. Nat. Rev.Dis. Prim. 2017, 3, 1–16. DOI: https://doi.org/10.1038/nrdp.2017.86

Marchesini, G.; Moscatiello, S.; Di Domizio, S.; Forlani, G. Obesity-associated liver disease. J. Clin. Endocrinol. Metab. 2008,93, s74–s80. DOI: https://doi.org/10.1210/jc.2008-1399

Seitz, H.K.; Bataller, R.; Cortez-Pinto, H.; Gao, B.; Gual, A.; Lackner, C.; Mathurin, P.; Mueller, S.; Szabo, G.; Tsukamoto, H.Alcoholic liver disease. Nat. Rev. Dis. Prim. 2018, 4, 1–22. DOI: https://doi.org/10.1038/s41572-018-0014-7

Åberg, F.; Färkkilä, M. Drinking and obesity: Alcoholic liver disease/nonalcoholic fatty liver disease interactions. In Seminars in Liver Disease; Thieme Medical Publishers: New York, NY, USA, 2020; Volume 40, pp. 154–162. DOI: https://doi.org/10.1055/s-0040-1701443

Bae, M.; Park, Y.K.; Lee, J.Y. Food components with antifibrotic activity and implications in prevention of liver disease. J. Nutr.Biochem. 2018, 55, 1–11. DOI: https://doi.org/10.1016/j.jnutbio.2017.11.003

Cai, J.; Zhang, X.J.; Li, H. Progress and challenges in the prevention and control of nonalcoholic fatty liver disease. Med. Res. Rev.2019, 39, 328–348. DOI: https://doi.org/10.1002/med.21515

Fazakis, N.; Kocsis, O.; Dritsas, E.; Alexiou, S.; Fakotakis, N.; Moustakas, K. Machine learning tools for long-term type 2 diabetes risk prediction. IEEE Access 2021, 9, 103737–103757. DOI: https://doi.org/10.1109/ACCESS.2021.3098691

Dritsas, E.; Trigka, M. Data-Driven Machine-Learning Methods for Diabetes Risk Prediction. Sensors 2022, 22, 5304. DOI: https://doi.org/10.3390/s22145304

Alexiou, S.; Dritsas, E.; Kocsis, O.; Moustakas, K.; Fakotakis, N. An approach for Personalized Continuous Glucose Prediction with Regression Trees. In Proceedings of the 2021 6th South-East Europe Design Automation, Computer Engineering, ComputerNetworks and Social Media Conference (SEEDA-CECNSM), Preveza, Greece, 24–26 September 2021; pp. 1–6. DOI: https://doi.org/10.1109/SEEDA-CECNSM53056.2021.9566278

Fazakis, N.; Dritsas, E.; Kocsis, O.; Fakotakis, N.; Moustakas, K. Long-Term Cholesterol Risk Prediction with Machine Learning Techniques in ELSA Database. In Proceedings of the 13th International Joint Conference on Computational Intelligence (IJCCI), Online, 24–26 October 2021; pp. 445–450. DOI: https://doi.org/10.5220/0010727200003063

Dritsas, E.; Fazakis, N.; Kocsis, O.; Fakotakis, N.; Moustakas, K. Long-Term Hypertension Risk Prediction with ML Techniques in ELSA Database. In Proceedings of the International Conference on Learning and Intelligent Optimization, Athens, Greece, 20–25 June 2021; pp. 113–120. DOI: https://doi.org/10.1007/978-3-030-92121-7_9

Dritsas, E.; Trigka, M. Machine Learning Methods for Hypercholesterolemia Long-Term Risk Prediction. Sensors 2022, 22, 5365. DOI: https://doi.org/10.3390/s22145365

Dritsas, E.; Alexiou, S.; Moustakas, K. COPD Severity Prediction in Elderly with ML Techniques. In Proceedings of the 15th International Conference on PErvasive Technologies Related to Assistive Environments, Corfu Island, Greece, 29 June–1 July 2022; pp. 185–189. DOI: https://doi.org/10.1145/3529190.3534748

Dritsas, E.; Trigka, M. Supervised Machine Learning Models to Identify Early-Stage Symptoms of SARS-CoV-2. Sensors 2023,3, 40. DOI: https://doi.org/10.3390/s23010040

Dritsas, E.; Trigka, M. Stroke Risk Prediction with Machine Learning Techniques.

Dritsas, E.; Trigka, M. Machine Learning Techniques for Chronic Kidney Disease Risk Prediction. Big Data Cogn. Comput. 2022, 6, 98. DOI: https://doi.org/10.3390/bdcc6030098

Dritsas, E.; Trigka, M. Lung Cancer Risk Prediction with Machine Learning Models. Big Data Cogn. Comput. 2022, 6, 1 DOI: https://doi.org/10.3390/bdcc6040139

Konstantoulas, I.; Kocsis, O.; Dritsas, E.; Fakotakis, N.; Moustakas, K. Sleep Quality Monitoring with Human Assisted Corrections. In Proceedings of the International Joint Conference on Computational Intelligence (IJCCI), Online, 24–26 October 2021; pp. 435–444. DOI: https://doi.org/10.5220/0010727100003063

Dritsas, E.; Alexiou, S.; Moustakas, K. Cardiovascular Disease Risk Prediction with Supervised Machine Learning Techniques. In Proceedings of the ICT4AWE, Online, 23–25 April 2022; pp. 315–321.

Indian Liver Patient Records. Available online: https://www.kaggle.com/datasets/uciml/indian-liver-patient-records (accessed on 14 November 2022).

Lin, H.; Yip, T.C.F.; Zhang, X.; Li, G.; Tse, Y.K.; Hui, V.W.K.; Liang, L.Y.; Lai, J.C.T.; Chan, S.L.; Chan, H.L.Y.; et al. Age and the relative importance of liver-related deaths in nonalcoholic fatty liver disease. Hepatology 2022. DOI: https://doi.org/10.1016/S0168-8278(22)00708-5

Mauvais-Jarvis, F.; Merz, N.B.; Barnes, P.J.; Brinton, R.D.; Carrero, J.J.; DeMeo, D.L.; De Vries, G.J.; Epperson, C.N.; Govindan, R.; Klein, S.L.; et al. Sex and gender: Modifiers of health, disease, and medicine. Lancet 2020, 396, 565–582. DOI: https://doi.org/10.1016/S0140-6736(20)31561-0

Ruiz, A.R.G.; Crespo, J.; Martínez, R.M.L.; Iruzubieta, P.; Mercadal, G.C.; Garcés, M.L.; Lavin, B.; Ruiz, M.M. Measurement, and clinical usefulness of bilirubin in liver disease. Adv. Lab. Med. Med. Lab. 2021, 2, 352–361. DOI: https://doi.org/10.1515/almed-2021-0047

Liu, Y.; Cavallaro, P.M.; Kim, B.M.; Liu, T.; Wang, H.; Kühn, F.; Adiliaghdam, F.; Liu, E.; Vasan, R.; Samarbafzadeh, E.; et al. A role for intestinal alkaline phosphatase in preventing liver fibrosis. Theranostics 2021, 11, 14. DOI: https://doi.org/10.7150/thno.48468

Goodarzi, R.; Sabzian, K.; Shishehbor, F.; Mansoori, A. Does turmeric/curcumin supplementation improve serum alanine aminotransferase and aspartate aminotransferase levels in patients with nonalcoholic fatty liver disease? A systematic review and meta-analysis of randomized controlled trials. Phytother. Res. 2019, 33, 561–570. DOI: https://doi.org/10.1002/ptr.6270

He, B.; Shi, J.;Wang, X.; Jiang, H.; Zhu, H.J. Genome-wide pQTL analysis of protein expression regulatory networks in the human liver. BMC Biol. 2020, 18, 1–16. DOI: https://doi.org/10.1186/s12915-020-00830-3

Carvalho, J.R.; Machado, M.V. New insights about albumin and liver disease. Ann. Hepatol. 2018, 17, 547–560. DOI: https://doi.org/10.5604/01.3001.0012.0916

Ye, Y.; Chen, W.; Gu, M.; Xian, G.; Pan, B.; Zheng, L.; Zhang, Z.; Sheng, P. Serum globulin and albumin to globulin ratio as potential diagnostic biomarkers for periprosthetic joint infection: A retrospective review. J. Orthop. Surg. Res. 2020, 15, 1–7. DOI: https://doi.org/10.1186/s13018-020-01959-1

Indian liver Patient Records. available online: https://github.com/aashitaarora/Classification-LiverDiseaseDataset-/blob/master/submission1_test.csv

Maldonado, S.; López, J.; Vairetti, C. An alternative SMOTE oversampling strategy for high-dimensional datasets. Appl. Soft Comput. 2019, 76, 380–389. DOI: https://doi.org/10.1016/j.asoc.2018.12.024

Dritsas, E.; Fazakis, N.; Kocsis, O.; Moustakas, K.; Fakotakis, N. Optimal Team Pairing of Elder Office Employees with Machine Learning on Synthetic Data. In Proceedings of the 2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA), Chania Crete, Greece, 12–14 July 2021; pp. 1–4. DOI: https://doi.org/10.1109/IISA52424.2021.9555511

Jain, D.; Singh, V. Attribute selection and classification systems for chronic disease prediction: A review. Egypt. Inform. J. 2018, 19, 179–189. DOI: https://doi.org/10.1016/j.eij.2018.03.002

Nusinovici, S.; Tham, Y.C.; Yan, M.Y.C.; Ting, D.S.W.; Li, J.; Sabanayagam, C.; Wong, T.Y.; Cheng, C.Y. Logistic regression was as good as machine learning for predicting major chronic diseases. J. Clin. Epidemiol. 2020, 122, 56–69. DOI: https://doi.org/10.1016/j.jclinepi.2020.03.002

Ghosh, S.; Dasgupta, A.; Swetapadma, A. A study on support vector machine based linear and non-linear pattern classification. In Proceedings of the 2019 International Conference on Intelligent Sustainable Systems (ICISS), Palladam, India, 21–22 February 2019; pp. 24–28. DOI: https://doi.org/10.1109/ISS1.2019.8908018

Nahar, Nazmun, and Ferdous Ara. "Liver disease prediction by using different decision tree techniques." International Journal of Data Mining & Knowledge Management Process 8.2 (2018): 01-09. DOI: https://doi.org/10.5121/ijdkp.2018.8201

Cunningham, P.; Delany, S.J. k-Nearest neighbour classifiers-A Tutorial. ACM Comput. Surv. (CSUR) 2021, 54, 1–25. DOI: https://doi.org/10.1145/3459665

Dong, X.; Yu, Z.; Cao,W.; Shi, Y.; Ma, Q. A survey on ensemble learning. Front. Comput. Sci. 2020, 14, 241–258. DOI: https://doi.org/10.1007/s11704-019-8208-z

Palimkar, P.; Shaw, R.N.; Ghosh, A. Machine learning technique to prognosis diabetes disease: RRandom Forest classifier approach. In Advanced Computing and Intelligent Technologies; Springer: Berlin/Heidelberg, Germany, 2022; pp. 219–244. DOI: https://doi.org/10.1007/978-981-16-2164-2_19

González, S.; García, S.; Del Ser, J.; Rokach, L.; Herrera, F. A practical tutorial on bagging and boosting based ensembles for machine learning: Algorithms, software tools, performance study, practical perspectives, and opportunities. Inf. Fusion 2020, 64, 205–237. DOI: https://doi.org/10.1016/j.inffus.2020.07.007

Handelman, G.S.; Kok, H.K.; Chandra, R.V.; Razavi, A.H.; Huang, S.; Brooks, M.; Lee, M.J.; Asadi, H. Peering into the black box of artificial intelligence: Evaluation metrics of machine learning methods. Am. J. Roentgenol. 2019, 212, 38–43. DOI: https://doi.org/10.2214/AJR.18.20224

Zhou, J.; Gandomi, A.H.; Chen, F.; Holzinger, A. Evaluating the quality of machine learning explanations: A survey on methods and metrics. Electronics 2021, 10, 593. DOI: https://doi.org/10.3390/electronics10050593

Swapna, K.; Prasad Babu, M. Critical analysis of Indian liver patient’s dataset using ANOVA method. Int. J. Eng. Technol 2017,7, 19–33.

Gulia, A.; Vohra, R.; Rani, P. Liver patient classification using intelligent techniques. Int. J. Comput. Sci. Inf. Technol. 2014,5, 5110–5115.

Kumar, P.; Thakur, R.S. Early detection of the liver disorder from imbalance liver function test datasets. Int. J. Innov. Technol.Explor. Eng. 2019, 8, 179–186.

Jin, H.; Kim, S.; Kim, J. Decision factors on effective liver patient data prediction. Int. J. Bio-Sci. Bio-Technol. 2014, 6, 167–178. DOI: https://doi.org/10.14257/ijbsbt.2014.6.4.16

Rahman, A.S.; Shamrat, F.J.M.; Tasnim, Z.; Roy, J.; Hossain, S.A. A comparative study on liver disease prediction using supervised machine learning algorithms. Int. J. Sci. Technol. Res. 2019, 8, 419–422.

M. Abdar, N. Y. Yen, and J. C.-S. Hung, "Improving the Diagnosis of Liver Disease Using Multilayer Perceptron Neural Network and Boosted Decision Trees," Journal of Medical and Biological Engineering, pp. 1-13, 2017. DOI: https://doi.org/10.1007/s40846-017-0360-z

Geetha, C.; Arunachalam, A. Evaluation based Approaches for Liver Disease Prediction using Machine Learning Algorithms. In Proceedings of the 2021 International Conference on Computer Communication and Informatics (ICCCI), Coimbatore, India,27–29 January 2021; pp. 1–4. DOI: https://doi.org/10.1109/ICCCI50826.2021.9402463

H. He and E. A. Garcia, "Learning from imbalanced data," IEEE Transactions on knowledge and data engineering, vol. 21, no. 9, pp. 1263-1284, 2009. DOI: https://doi.org/10.1109/TKDE.2008.239

Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP: SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 2002, 16:341–378. DOI: https://doi.org/10.1613/jair.953

Dritsas E, Trigka M. Supervised Machine Learning Models for Liver Disease Risk Prediction. Computers. 2023;12(1):19. https://doi.org/10.3390/computers12010019. DOI: https://doi.org/10.3390/computers12010019

I. Hanif and M. M. Khan, "Liver Cirrhosis Prediction using Machine Learning Approaches," 2022 IEEE 13th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON), New York, NY, NY, USA, 2022, pp. 0028-0034, doi: 10.1109/UEMCON54665.2022.9965718. DOI: https://doi.org/10.1109/UEMCON54665.2022.9965718

Sachdeva, R.K., Bathla, P., Rani, P. et al. A systematic method for diagnosis of hepatitis disease using machine learning. Innovations Syst Softw Eng 19, 71–80 (2023). https://doi.org/10.1007/s11334-022-00509-8. DOI: https://doi.org/10.1007/s11334-022-00509-8

H. S. Yadav and R. K. Singhal, "Classification and Prediction of Liver Disease Diagnosis Using Machine Learning Algorithms," 2023 2nd International Conference for Innovation in Technology (INOCON), Bangalore, India, 2023, pp.1-6, doi: 10.1109/INOCON57975.2023.10101221. DOI: https://doi.org/10.1109/INOCON57975.2023.10101221

Downloads

Published

13-11-2023

How to Cite

1.
Makkena KR, Natarajan K. Classification Algorithms for Liver Epidemic Identification. EAI Endorsed Trans Perv Health Tech [Internet]. 2023 Nov. 13 [cited 2024 Dec. 26];9. Available from: https://publications.eai.eu/index.php/phat/article/view/4379