Predicting Diabetes Disease for healthy smart cities
Keywords:Data Mining, Diabetes, CRISP-DM, Classification, ML Models, Smart Cities, Smart Health
INTRODUCTION: Diabetes is a chronic condition that affects a large portion of the population and is the leading cause of numerous health problems. Its automatic detection could improve the communities’ overall well-being.
OBJECTIVES: The primary goal was to introduce advancements to the subject of healthy smart cities by studying an approach for predicting the occurrence of diabetes in the Pima Female Adult Population using data mining.
METHODS: This study uses CRISP-DM to analyze the results of six different models acquired from three different iterations of the same dataset.
DISCUSSION: This study found that the most promising model is k-NN, which obtained results of almost 92% of F1 Score with the third data preparation strategy.
CONCLUSION: Acceptable results were achieved with the k-NN model and the third data preparation strategy, but more research into improving the data preparation processes and their influence on the outputs of each model is needed.
How to Cite
Copyright (c) 2022 EAI Endorsed Transactions on Smart Cities
This work is licensed under a Creative Commons Attribution 3.0 Unported License.
This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.
Fundação para a Ciência e a Tecnologia
Grant numbers UIDB/00319/2020