Predicting Diabetes Disease for healthy smart cities




Data Mining, Diabetes, CRISP-DM, Classification, ML Models, Smart Cities, Smart Health


INTRODUCTION: Diabetes is a chronic condition that affects a large portion of the population and is the leading cause of numerous health problems. Its automatic detection could improve the communities’ overall well-being.
OBJECTIVES: The primary goal was to introduce advancements to the subject of healthy smart cities by studying an approach for predicting the occurrence of diabetes in the Pima Female Adult Population using data mining.
METHODS: This study uses CRISP-DM to analyze the results of six different models acquired from three different iterations of the same dataset.
DISCUSSION: This study found that the most promising model is k-NN, which obtained results of almost 92% of F1 Score with the third data preparation strategy.
CONCLUSION: Acceptable results were achieved with the k-NN model and the third data preparation strategy, but more research into improving the data preparation processes and their influence on the outputs of each model is needed.


Download data is not yet available.




How to Cite

H. Peixoto, V. Ramos, . C. Marques, and J. Machado, “Predicting Diabetes Disease for healthy smart cities”, EAI Endorsed Trans Smart Cities, vol. 6, no. 18, p. e1, Apr. 2022.

Funding data