Classification Algorithms for Liver Epidemic Identification
DOI:
https://doi.org/10.4108/eetpht.9.4379Keywords:
medical care, liver epidemic, prognosis, classification models, Sythetic Minority Over Sampling TechniqueAbstract
Situated in the upper right region of the abdomen, beneath the diaphragm and above the stomach, lies the liver. It is a crucial organ essential for the proper functioning of the body. The principal tasks are to eliminate generated waste produced by our organs, and digestive food and preserve vitamins and energy materials. It performs many important functions in the body, it regulates the balance of hormones in the body filtering and removing bacteria, viruses, and other harmful substances from the blood. In certain dire circumstances, the outcome can unfortunately result in fatality. There exist numerous classifications of liver diseases, based on their causes or distinguishing characteristics. Some common categories of liver disease include Viral hepatitis, Autoimmune liver disease, Metabolic liver disease, Alcohol-related liver disease, Non-alcoholic fatty liver disease, Genetic liver disease, Drug-induced liver injury, Biliary tract disorders. Machine learning algorithms can help identify patterns and risk factors that may be difficult for humans to detect. With this clinicians can enable early diagnosis of diseases, leading to better treatment outcomes and improved patient care. In this research work, different types of machine learning methods are implemented and compared in terms of performance metrics to identify whether a person effected or not. The algorithms used here for predicting liver patients are Random Forest classifier, K-nearest neighbor, XGBoost, Decision tree, Logistic Regression, support vector machine, Extra Trees Classifier. The experimental results showed that the accuracy of various machine learning models-Random Forest classifier-67.4%, K-nearest neighbor-54.8%, XGBoost-72%, Decision tree-65.1%, Logistic Regression-68.0%, support vector machine-65.1%, Extra Trees Classifier-70.2% after applying Synthetic Minority Over-sampling technique.
Downloads
References
Arias, I.M.; Alter, H.J.; Boyer, J.L.; Cohen, D.E.; Shafritz, D.A.; Thorgeirsson, S.S.; Wolkoff, A.W. The Liver: Biology and Pathobiology;John Wiley & Sons: Hoboken, NJ, USA, 2020. DOI: https://doi.org/10.1002/9781119436812
Singh, H.R.; Rabi, S. Study of morphological variations of liver in human. Transl. Res. Anat. 2019, 14, 1–5. DOI: https://doi.org/10.1016/j.tria.2018.11.004
Razavi, H. Global epidemiology of viral hepatitis. Gastroenterol. Clin. 2020, 49, 179–189. DOI: https://doi.org/10.1016/j.gtc.2020.01.001
Seitz, H.K., Bataller, R., Cortez-Pinto, H. et al. Alcoholic liver disease. Nat Rev Dis Primers 4, 16 (2018). https://doi.org/10.1038/s41572-018-0014-7
Powell, E.E.;Wong, V.W.S.; Rinella, M. Non-alcoholic fatty liver disease. Lancet 2021, 397, 2212–2224. DOI: https://doi.org/10.1016/S0140-6736(20)32511-3
Ringehan, M.; McKeating, J.A.; Protzer, U. Viral hepatitis and liver cancer. Philos. Trans. R. Soc. B Biol. Sci. 2017, 372, 20160274. DOI: https://doi.org/10.1098/rstb.2016.0274
Smith, A.; Baumgartner, K.; Bositis, C. Cirrhosis: Diagnosis and management. Am. Fam. Physician 2019, 100, 759–770.
https://www.cdc.gov/hepatitis/hav/pdfs/HepAGeneralFactSheet.pdf
Yuen, M.F.; Chen, D.S.; Dusheiko, G.M.; Janssen, H.L.; Lau, D.T.; Locarnini, S.A.; Peters, M.G.; Lai, C.L. Hepatitis B virus infection.Nat. Rev. Dis. Prim. 2018, 4, 1–20. DOI: https://doi.org/10.1038/nrdp.2018.35
Manns, M.P.; Buti, M.; Gane, E.; Pawlotsky, J.M.; Razavi, H.; Terrault, N.; Younossi, Z. Hepatitis C virus infection. Nat. Rev. Dis.Prim. 2017, 3, 1–19. DOI: https://doi.org/10.1038/nrdp.2017.6
Mentha, N.; Clément, S.; Negro, F.; Alfaiate, D. A review on hepatitis D: From virology to new therapies. J. Adv. Res. 2019,17, 3–15. DOI: https://doi.org/10.1016/j.jare.2019.03.009
Kamar, N.; Izopet, J.; Pavio, N.; Aggarwal, R.; Labrique, A.;Wedemeyer, H.; Dalton, H.R. Hepatitis E virus infection. Nat. Rev.Dis. Prim. 2017, 3, 1–16. DOI: https://doi.org/10.1038/nrdp.2017.86
Marchesini, G.; Moscatiello, S.; Di Domizio, S.; Forlani, G. Obesity-associated liver disease. J. Clin. Endocrinol. Metab. 2008,93, s74–s80. DOI: https://doi.org/10.1210/jc.2008-1399
Seitz, H.K.; Bataller, R.; Cortez-Pinto, H.; Gao, B.; Gual, A.; Lackner, C.; Mathurin, P.; Mueller, S.; Szabo, G.; Tsukamoto, H.Alcoholic liver disease. Nat. Rev. Dis. Prim. 2018, 4, 1–22. DOI: https://doi.org/10.1038/s41572-018-0014-7
Åberg, F.; Färkkilä, M. Drinking and obesity: Alcoholic liver disease/nonalcoholic fatty liver disease interactions. In Seminars in Liver Disease; Thieme Medical Publishers: New York, NY, USA, 2020; Volume 40, pp. 154–162. DOI: https://doi.org/10.1055/s-0040-1701443
Bae, M.; Park, Y.K.; Lee, J.Y. Food components with antifibrotic activity and implications in prevention of liver disease. J. Nutr.Biochem. 2018, 55, 1–11. DOI: https://doi.org/10.1016/j.jnutbio.2017.11.003
Cai, J.; Zhang, X.J.; Li, H. Progress and challenges in the prevention and control of nonalcoholic fatty liver disease. Med. Res. Rev.2019, 39, 328–348. DOI: https://doi.org/10.1002/med.21515
Fazakis, N.; Kocsis, O.; Dritsas, E.; Alexiou, S.; Fakotakis, N.; Moustakas, K. Machine learning tools for long-term type 2 diabetes risk prediction. IEEE Access 2021, 9, 103737–103757. DOI: https://doi.org/10.1109/ACCESS.2021.3098691
Dritsas, E.; Trigka, M. Data-Driven Machine-Learning Methods for Diabetes Risk Prediction. Sensors 2022, 22, 5304. DOI: https://doi.org/10.3390/s22145304
Alexiou, S.; Dritsas, E.; Kocsis, O.; Moustakas, K.; Fakotakis, N. An approach for Personalized Continuous Glucose Prediction with Regression Trees. In Proceedings of the 2021 6th South-East Europe Design Automation, Computer Engineering, ComputerNetworks and Social Media Conference (SEEDA-CECNSM), Preveza, Greece, 24–26 September 2021; pp. 1–6. DOI: https://doi.org/10.1109/SEEDA-CECNSM53056.2021.9566278
Fazakis, N.; Dritsas, E.; Kocsis, O.; Fakotakis, N.; Moustakas, K. Long-Term Cholesterol Risk Prediction with Machine Learning Techniques in ELSA Database. In Proceedings of the 13th International Joint Conference on Computational Intelligence (IJCCI), Online, 24–26 October 2021; pp. 445–450. DOI: https://doi.org/10.5220/0010727200003063
Dritsas, E.; Fazakis, N.; Kocsis, O.; Fakotakis, N.; Moustakas, K. Long-Term Hypertension Risk Prediction with ML Techniques in ELSA Database. In Proceedings of the International Conference on Learning and Intelligent Optimization, Athens, Greece, 20–25 June 2021; pp. 113–120. DOI: https://doi.org/10.1007/978-3-030-92121-7_9
Dritsas, E.; Trigka, M. Machine Learning Methods for Hypercholesterolemia Long-Term Risk Prediction. Sensors 2022, 22, 5365. DOI: https://doi.org/10.3390/s22145365
Dritsas, E.; Alexiou, S.; Moustakas, K. COPD Severity Prediction in Elderly with ML Techniques. In Proceedings of the 15th International Conference on PErvasive Technologies Related to Assistive Environments, Corfu Island, Greece, 29 June–1 July 2022; pp. 185–189. DOI: https://doi.org/10.1145/3529190.3534748
Dritsas, E.; Trigka, M. Supervised Machine Learning Models to Identify Early-Stage Symptoms of SARS-CoV-2. Sensors 2023,3, 40. DOI: https://doi.org/10.3390/s23010040
Dritsas, E.; Trigka, M. Stroke Risk Prediction with Machine Learning Techniques.
Dritsas, E.; Trigka, M. Machine Learning Techniques for Chronic Kidney Disease Risk Prediction. Big Data Cogn. Comput. 2022, 6, 98. DOI: https://doi.org/10.3390/bdcc6030098
Dritsas, E.; Trigka, M. Lung Cancer Risk Prediction with Machine Learning Models. Big Data Cogn. Comput. 2022, 6, 1 DOI: https://doi.org/10.3390/bdcc6040139
Konstantoulas, I.; Kocsis, O.; Dritsas, E.; Fakotakis, N.; Moustakas, K. Sleep Quality Monitoring with Human Assisted Corrections. In Proceedings of the International Joint Conference on Computational Intelligence (IJCCI), Online, 24–26 October 2021; pp. 435–444. DOI: https://doi.org/10.5220/0010727100003063
Dritsas, E.; Alexiou, S.; Moustakas, K. Cardiovascular Disease Risk Prediction with Supervised Machine Learning Techniques. In Proceedings of the ICT4AWE, Online, 23–25 April 2022; pp. 315–321.
Indian Liver Patient Records. Available online: https://www.kaggle.com/datasets/uciml/indian-liver-patient-records (accessed on 14 November 2022).
Lin, H.; Yip, T.C.F.; Zhang, X.; Li, G.; Tse, Y.K.; Hui, V.W.K.; Liang, L.Y.; Lai, J.C.T.; Chan, S.L.; Chan, H.L.Y.; et al. Age and the relative importance of liver-related deaths in nonalcoholic fatty liver disease. Hepatology 2022. DOI: https://doi.org/10.1016/S0168-8278(22)00708-5
Mauvais-Jarvis, F.; Merz, N.B.; Barnes, P.J.; Brinton, R.D.; Carrero, J.J.; DeMeo, D.L.; De Vries, G.J.; Epperson, C.N.; Govindan, R.; Klein, S.L.; et al. Sex and gender: Modifiers of health, disease, and medicine. Lancet 2020, 396, 565–582. DOI: https://doi.org/10.1016/S0140-6736(20)31561-0
Ruiz, A.R.G.; Crespo, J.; Martínez, R.M.L.; Iruzubieta, P.; Mercadal, G.C.; Garcés, M.L.; Lavin, B.; Ruiz, M.M. Measurement, and clinical usefulness of bilirubin in liver disease. Adv. Lab. Med. Med. Lab. 2021, 2, 352–361. DOI: https://doi.org/10.1515/almed-2021-0047
Liu, Y.; Cavallaro, P.M.; Kim, B.M.; Liu, T.; Wang, H.; Kühn, F.; Adiliaghdam, F.; Liu, E.; Vasan, R.; Samarbafzadeh, E.; et al. A role for intestinal alkaline phosphatase in preventing liver fibrosis. Theranostics 2021, 11, 14. DOI: https://doi.org/10.7150/thno.48468
Goodarzi, R.; Sabzian, K.; Shishehbor, F.; Mansoori, A. Does turmeric/curcumin supplementation improve serum alanine aminotransferase and aspartate aminotransferase levels in patients with nonalcoholic fatty liver disease? A systematic review and meta-analysis of randomized controlled trials. Phytother. Res. 2019, 33, 561–570. DOI: https://doi.org/10.1002/ptr.6270
He, B.; Shi, J.;Wang, X.; Jiang, H.; Zhu, H.J. Genome-wide pQTL analysis of protein expression regulatory networks in the human liver. BMC Biol. 2020, 18, 1–16. DOI: https://doi.org/10.1186/s12915-020-00830-3
Carvalho, J.R.; Machado, M.V. New insights about albumin and liver disease. Ann. Hepatol. 2018, 17, 547–560. DOI: https://doi.org/10.5604/01.3001.0012.0916
Ye, Y.; Chen, W.; Gu, M.; Xian, G.; Pan, B.; Zheng, L.; Zhang, Z.; Sheng, P. Serum globulin and albumin to globulin ratio as potential diagnostic biomarkers for periprosthetic joint infection: A retrospective review. J. Orthop. Surg. Res. 2020, 15, 1–7. DOI: https://doi.org/10.1186/s13018-020-01959-1
Indian liver Patient Records. available online: https://github.com/aashitaarora/Classification-LiverDiseaseDataset-/blob/master/submission1_test.csv
Maldonado, S.; López, J.; Vairetti, C. An alternative SMOTE oversampling strategy for high-dimensional datasets. Appl. Soft Comput. 2019, 76, 380–389. DOI: https://doi.org/10.1016/j.asoc.2018.12.024
Dritsas, E.; Fazakis, N.; Kocsis, O.; Moustakas, K.; Fakotakis, N. Optimal Team Pairing of Elder Office Employees with Machine Learning on Synthetic Data. In Proceedings of the 2021 12th International Conference on Information, Intelligence, Systems & Applications (IISA), Chania Crete, Greece, 12–14 July 2021; pp. 1–4. DOI: https://doi.org/10.1109/IISA52424.2021.9555511
Jain, D.; Singh, V. Attribute selection and classification systems for chronic disease prediction: A review. Egypt. Inform. J. 2018, 19, 179–189. DOI: https://doi.org/10.1016/j.eij.2018.03.002
Nusinovici, S.; Tham, Y.C.; Yan, M.Y.C.; Ting, D.S.W.; Li, J.; Sabanayagam, C.; Wong, T.Y.; Cheng, C.Y. Logistic regression was as good as machine learning for predicting major chronic diseases. J. Clin. Epidemiol. 2020, 122, 56–69. DOI: https://doi.org/10.1016/j.jclinepi.2020.03.002
Ghosh, S.; Dasgupta, A.; Swetapadma, A. A study on support vector machine based linear and non-linear pattern classification. In Proceedings of the 2019 International Conference on Intelligent Sustainable Systems (ICISS), Palladam, India, 21–22 February 2019; pp. 24–28. DOI: https://doi.org/10.1109/ISS1.2019.8908018
Nahar, Nazmun, and Ferdous Ara. "Liver disease prediction by using different decision tree techniques." International Journal of Data Mining & Knowledge Management Process 8.2 (2018): 01-09. DOI: https://doi.org/10.5121/ijdkp.2018.8201
Cunningham, P.; Delany, S.J. k-Nearest neighbour classifiers-A Tutorial. ACM Comput. Surv. (CSUR) 2021, 54, 1–25. DOI: https://doi.org/10.1145/3459665
Dong, X.; Yu, Z.; Cao,W.; Shi, Y.; Ma, Q. A survey on ensemble learning. Front. Comput. Sci. 2020, 14, 241–258. DOI: https://doi.org/10.1007/s11704-019-8208-z
Palimkar, P.; Shaw, R.N.; Ghosh, A. Machine learning technique to prognosis diabetes disease: RRandom Forest classifier approach. In Advanced Computing and Intelligent Technologies; Springer: Berlin/Heidelberg, Germany, 2022; pp. 219–244. DOI: https://doi.org/10.1007/978-981-16-2164-2_19
González, S.; García, S.; Del Ser, J.; Rokach, L.; Herrera, F. A practical tutorial on bagging and boosting based ensembles for machine learning: Algorithms, software tools, performance study, practical perspectives, and opportunities. Inf. Fusion 2020, 64, 205–237. DOI: https://doi.org/10.1016/j.inffus.2020.07.007
Handelman, G.S.; Kok, H.K.; Chandra, R.V.; Razavi, A.H.; Huang, S.; Brooks, M.; Lee, M.J.; Asadi, H. Peering into the black box of artificial intelligence: Evaluation metrics of machine learning methods. Am. J. Roentgenol. 2019, 212, 38–43. DOI: https://doi.org/10.2214/AJR.18.20224
Zhou, J.; Gandomi, A.H.; Chen, F.; Holzinger, A. Evaluating the quality of machine learning explanations: A survey on methods and metrics. Electronics 2021, 10, 593. DOI: https://doi.org/10.3390/electronics10050593
Swapna, K.; Prasad Babu, M. Critical analysis of Indian liver patient’s dataset using ANOVA method. Int. J. Eng. Technol 2017,7, 19–33.
Gulia, A.; Vohra, R.; Rani, P. Liver patient classification using intelligent techniques. Int. J. Comput. Sci. Inf. Technol. 2014,5, 5110–5115.
Kumar, P.; Thakur, R.S. Early detection of the liver disorder from imbalance liver function test datasets. Int. J. Innov. Technol.Explor. Eng. 2019, 8, 179–186.
Jin, H.; Kim, S.; Kim, J. Decision factors on effective liver patient data prediction. Int. J. Bio-Sci. Bio-Technol. 2014, 6, 167–178. DOI: https://doi.org/10.14257/ijbsbt.2014.6.4.16
Rahman, A.S.; Shamrat, F.J.M.; Tasnim, Z.; Roy, J.; Hossain, S.A. A comparative study on liver disease prediction using supervised machine learning algorithms. Int. J. Sci. Technol. Res. 2019, 8, 419–422.
M. Abdar, N. Y. Yen, and J. C.-S. Hung, "Improving the Diagnosis of Liver Disease Using Multilayer Perceptron Neural Network and Boosted Decision Trees," Journal of Medical and Biological Engineering, pp. 1-13, 2017. DOI: https://doi.org/10.1007/s40846-017-0360-z
Geetha, C.; Arunachalam, A. Evaluation based Approaches for Liver Disease Prediction using Machine Learning Algorithms. In Proceedings of the 2021 International Conference on Computer Communication and Informatics (ICCCI), Coimbatore, India,27–29 January 2021; pp. 1–4. DOI: https://doi.org/10.1109/ICCCI50826.2021.9402463
H. He and E. A. Garcia, "Learning from imbalanced data," IEEE Transactions on knowledge and data engineering, vol. 21, no. 9, pp. 1263-1284, 2009. DOI: https://doi.org/10.1109/TKDE.2008.239
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP: SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 2002, 16:341–378. DOI: https://doi.org/10.1613/jair.953
Dritsas E, Trigka M. Supervised Machine Learning Models for Liver Disease Risk Prediction. Computers. 2023;12(1):19. https://doi.org/10.3390/computers12010019. DOI: https://doi.org/10.3390/computers12010019
I. Hanif and M. M. Khan, "Liver Cirrhosis Prediction using Machine Learning Approaches," 2022 IEEE 13th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON), New York, NY, NY, USA, 2022, pp. 0028-0034, doi: 10.1109/UEMCON54665.2022.9965718. DOI: https://doi.org/10.1109/UEMCON54665.2022.9965718
Sachdeva, R.K., Bathla, P., Rani, P. et al. A systematic method for diagnosis of hepatitis disease using machine learning. Innovations Syst Softw Eng 19, 71–80 (2023). https://doi.org/10.1007/s11334-022-00509-8. DOI: https://doi.org/10.1007/s11334-022-00509-8
H. S. Yadav and R. K. Singhal, "Classification and Prediction of Liver Disease Diagnosis Using Machine Learning Algorithms," 2023 2nd International Conference for Innovation in Technology (INOCON), Bangalore, India, 2023, pp.1-6, doi: 10.1109/INOCON57975.2023.10101221. DOI: https://doi.org/10.1109/INOCON57975.2023.10101221
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Koteswara Rao Makkena, Karthika Natarajan
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.