A predictive prototype for the identification of diseases relied on the symptoms described by patients


  • Suvendu Kumar Nayak Centurion University of Technology and Management image/svg+xml
  • Mamata Garanayak KISS University
  • Sangram Keshari Swain Centurion University of Technology and Management image/svg+xml




Prediction of disease, k-means, Random forest, Multinomial linear regression, CART prototype, KNN


INTRODUCTION: A thorough and timely investigation of any health-related problem is essential for disease prevention and treatment. The normal way of diagnosis may not be sufficient in the event of a serious illness problem.

OBJECTIVE: Creating a medical diagnosis prototype that uses many machine learning processes to forecast any illness relied on symptoms explained by patients can lead to an errorless diagnosis as compared to the traditional ways.

METHODS: We created a disease prediction prototype using ML techniques such as random forest, CART, multinomial linear regression, and KNN. The data set utilized for processing contained over 132 illnesses. Diagnosis algorithm outcomes the ailment that the person may be suffering from relied on the symptoms provided by the patients.

RESULTS: When compared to CART and random forest (accuracy is 97.72%, multinomial linear regression and KNN produced the best outcomes. The accuracy of the KNN prediction and multinomial linear regression techniques was 98.76%.

CONCLUSION: The diagnostic prototype can function as a doctor in the early detection of an illness, ensuring that medical care can begin in an appropriate time and many lives can be secured.


Download data is not yet available.


Jia, Z., Zeng, X., Duan, H., Lu, X., & Li, H. A patient-similarity-based model for diagnostic prediction. International journal of medical informatics, 135, 104073 (2020). DOI: https://doi.org/10.1016/j.ijmedinf.2019.104073

Lynch, C. J., & Liston, C. New machine-learning technologies for computer-aided diagnosis. Nature medicine, 24(9), 1304-1305 (2018). DOI: https://doi.org/10.1038/s41591-018-0178-4

Hashem, A. M., Rasmy, M. E. M., Wahba, K. M., & Shaker, O. G. Prediction of the degree of liver fibrosis using different pattern recognition techniques. In 2010 5th Cairo International Biomedical Engineering Conference (pp. 210- 214). IEEE (2010, December). DOI: https://doi.org/10.1109/CIBEC.2010.5716043

Singh, A., & Pandey, B. (2016). Diagnosis of liver disease using correlation distance metric based k-nearest neighbor approach. In Intelligent Systems Technologies and Applications 2016 (pp. 845-856). Springer International Publishing DOI: https://doi.org/10.1007/978-3-319-47952-1_67

Parimbelli, E., Marini, S., Sacchi, L., & Bellazzi, R. (2018). Patient similarity for precision medicine: A systematic review. Journal of biomedical informatics, 83, 87-96. DOI: https://doi.org/10.1016/j.jbi.2018.06.001

Sartakhti, J. S., Zangooei, M. H., & Mozafari, K. (2012). Hepatitis disease diagnosis using a novel hybrid method based on support vector machine and simulated annealing (SVM-SA). Computer methods and programs in biomedicine, 108(2), 570-579.. DOI: https://doi.org/10.1016/j.cmpb.2011.08.003

Olsen, C. R., Mentz, R. J., Anstrom, K. J., Page, D., & Patel, P. A. (2020). Clinical applications of machine learning in the diagnosis, classification, and prediction of heart failure. American Heart Journal, 229, 1-17. DOI: https://doi.org/10.1016/j.ahj.2020.07.009

Macedo Hair, G., Fonseca Nobre, F., & Brasil, P. (2019). Characterization of clinical patterns of dengue patients using an unsupervised machine learning approach. BMC infectious diseases, 19, 1-11. DOI: https://doi.org/10.1186/s12879-019-4282-y

Saranya, G., & Pravin, A. (2020). A comprehensive study on disease risk predictions in machine learning. International Journal of Electrical and Computer Engineering, 10(4), 4217. DOI: https://doi.org/10.11591/ijece.v10i4.pp4217-4225

Razavian, N., & Sontag, D. (2015). Temporal convolutional neural networks for diagnosis from lab tests. arXiv preprint arXiv:1511.07938.

Nithya, B., & Ilango, V. (2017, June). Predictive analytics in health care using machine learning tools and techniques. In 2017 International Conference on Intelligent Computing and Control Systems (ICICCS) (pp. 492-499). IEEE. DOI: https://doi.org/10.1109/ICCONS.2017.8250771

Shamir, R. R., Dolber, T., Noecker, A. M., Walter, B. L., & McIntyre, C. C. (2015). Machine learning approach to optimizing combined stimulation and medication therapies for Parkinson's disease. Brain stimulation, 8(6), 1025-1032. DOI: https://doi.org/10.1016/j.brs.2015.06.003

Jin, Z., Sun, Y., & Cheng, A. C. (2009, September). Predicting cardiovascular disease from real-time electrocardiographic monitoring: An adaptive machine learning approach on a cell phone. In 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society (pp. 6889-6892). IEEE.

Deepthi, Y., Kalyan, K. P., Vyas, M., Radhika, K., Babu, D. K., & Krishna Rao, N. V. (2020). Disease prediction based on symptoms using machine learning. In Energy Systems, Drives and Automations: Proceedings of ESDA 2019 (pp. 561-569). Singapore: Springer Singapore. DOI: https://doi.org/10.1007/978-981-15-5089-8_55

Kanchan, B. D., & Kishor, M. M. (2016, December). Study of machine learning algorithms for special disease prediction using principal of component analysis. In 2016 international conference on global trends in signal processing, information computing and communication (ICGTSPICC) (pp. 5-10). IEEE. DOI: https://doi.org/10.1109/ICGTSPICC.2016.7955260

Singh, A. S., Irfan, M., & Chowdhury, A. (2018, December). Prediction of liver disease using classification algorithms. In 2018 4th international conference on computing communication and automation (ICCCA) (pp. 1-3). IEEE

Grampurohit, S., & Sagarnal, C. (2020, June). Disease prediction using machine learning algorithms. In 2020 International Conference for Emerging Technology (INCET) (pp. 1-7). IEEE. DOI: https://doi.org/10.1109/INCET49848.2020.9154130

Hamsagayathri, P., & Vigneshwaran, S. (2021, February). Symptoms based disease prediction using machine learning techniques. In 2021 Third international conference on intelligent communication technologies and virtual mobile networks (ICICV) (pp. 747-752). IEEE. DOI: https://doi.org/10.1109/ICICV50876.2021.9388603

Dahiwade, Dhiraj, Gajanan Patle, and Ektaa Meshram. "Designing disease prediction model using machine learning approach." In 2019 3rd International Conference on Computing Methodologies and Communication (ICCMC), pp. 1211-1215. IEEE, 2019. DOI: https://doi.org/10.1109/ICCMC.2019.8819782

Alexander, N., Alexander, D. C., Barkhof, F., & Denaxas, S. (2021). Identifying and evaluating clinical subtypes of Alzheimer’s disease in care electronic health records using unsupervised machine learning. BMC Medical Informatics and Decision Making, 21(1), 1-13. DOI: https://doi.org/10.1186/s12911-021-01693-6

Garanayak, M., Nayak, S. K., Sangeetha, K., Choudhury, T., & Shitharth, S. (2022). Content and Popularity-Based Music Recommendation System. International Journal of Information System Modeling and Design (IJISMD), 13(7), 1-14. DOI: https://doi.org/10.4018/ijismd.315027

Choudhury, S. S., Mohanty, S. N., & Jagadev, A. K. (2021). Multimodal trust based recommender system with machine learning approaches for movie recommendation. International Journal of Information Technology, 13, 475-482. DOI: https://doi.org/10.1007/s41870-020-00553-2




How to Cite

Nayak SK, Garanayak M, Swain SK. A predictive prototype for the identification of diseases relied on the symptoms described by patients. EAI Endorsed Trans Perv Health Tech [Internet]. 2024 Mar. 13 [cited 2024 Apr. 25];10. Available from: https://publications.eai.eu/index.php/phat/article/view/5405