Imbalanced Multiclass Medical Data Classification based on Learning Automata and Neural Network

Authors

DOI:

https://doi.org/10.4108/airo.3526

Keywords:

Classification, Imbalanced Data, Neural Network, Learning Automata

Abstract

Data classification in the real world is often faced with the challenge of data imbalance, where there is a
significant difference in the number of instances among different classes. Dealing with imbalanced data is
recognized as a challenging problem in data mining, as it involves identifying minority-class data with a
high number of errors. Therefore, the selection of unique and appropriate features for classifying data with
smaller classes poses a fundamental challenge in this research. Nowadays, due to the widespread presence
of imbalanced medical data in many real-world problems, the processing of such data has gained attention
from researchers. The objective of this research is to propose a method for classifying imbalanced medical
data. In this paper, the hypothyroidism dataset from the UCI repository is used. In the feature selection stage,
a support vector machine algorithm is used as a cost function, and the wrapper algorithm is employed as
a search strategy to achieve an optimal subset of features. The proposed method achieves high accuracy,
reaching 99.6% accuracy for data classification through the optimization of a neural network using learning
automata.

Downloads

Download data is not yet available.

References

Harouni M, Baghmaleki HY. Color image segmentation metrics. Encyclopedia of Image Processing. 2018 Nov 8;95:10-21.

Moshayedi AJ, Khan AS, Shuxin Y, Kuan G, Jiandong H, Soleimani M, Razi A. E-Nose design and structures from statistical analysis to application in robotic: a compressive review. EAI Endorsed Transactions on AI and Robotics. 2023 Apr 20;2(1):e1.

Mahmudi F, Soleimani M. Some results on Maximal Graph of a Commutative Ring, 2016.

Hoorfar H, Bagheri A. Geometric Embedding of Path and Cycle Graphs in Pseudo-convex Polygons. arXiv preprint arXiv:1708.01457. 2017 Aug 4.

Jia, F., Lei, Y., Lu, N., Xing, S. (2018). Deep normalized convolutional neural network for imbalanced fault classification of machinery and its understanding via visualization. Mechanical Systems and Signal Processing, 110, 349-367.

Harouni M, Karimi M, Rafieipour S. Precise segmentation techniques in various medical images. Artificial Intelli-gence and Internet of Things. 2021 Aug 25:117-66.

Karimi M, Harouni M, Rafieipour S. Automated medical image analysis in digital mammography. InArtificial Intelligence and Internet of Things 2021 Aug 25 (pp. 85-116). CRC Press.

García, S., Zhang, Z. L., Altalhi, A., Alshomrani, S., Herrera, F. (2018). Dynamic ensemble selection for multi-class imbalanced datasets. Information Sciences, 445, 22-37.

Verma AK, Pal S, Kumar S (2019) Classification of skin disease using ensemble data mining techniques. Asian Pac J Cancer Prev 20(6):1887–1894.

Ahmad W, Huang L, Ahmad A, Shah F, Iqbal A, Saeed A (2017) Thyroid diseases forecasting using a hybrid decision support system based on ANFIS, k-NN and information gain method.J Appl Environ Biol Sci 7(10):78–85.

Khan AR, Doosti F, Karimi M, Harouni M, Tariq U, Fati SM, Ali Bahaj S. Authentication through gender classification from iris images using support vector machine. Microscopy research and technique. 2021 Nov;84(11):2666-76.

Soleimani M, Mahmudi F, Naderi MH. On the Maximal Graph of a Commutative Ring. Mathematics Interdisci-plinary Research. 2021 Jul 2.

Chaurasia V, Pal S, Tiwari BB (2018) Prediction of benign and malignant breast cancer using data mining techniques. J Algorithm Comput Technol 12(2):119–126.

Moshayedi, A. J., Gharpure, D. C. (2012, May). Development of position monitoring system for studying performance of wind tracking algorithms. In ROBOTIK 2012; 7th German Conference on Robotics (pp. 1-4). VDE.

Sumathi A, Nithya G, Meganathan S (2018) Classifica-tion of thyroid disease using data mining techniques. Int J Pure Appl Math 119(12):13881–13890

Moshayedi, A. J., Li, J., Sina, N., Chen, X., Liao, L., Gheisari, M., X. Xie,(2022). Simulation and validation of optimized pid controller in agv (automated guided vehi-cles) model using pso and bas algorithms. Computational Intelligence and Neuroscience, 2022.

Pravin SR, Jafar OA (2017) Prediction of skin disease using data mining techniques. IJARCCE 6(7):313–318.

. Wang J, Li S, Song W, Qin H, Zhang B, Hao A (2018) Learning from weakly-labeled clinical data for automatic thyroid nodule classification in ultrasound images. In: 2018 25th IEEE international conference on image processing (ICIP), IEEE, pp 3114–3118.

Sivasakthivel A, Shrivakshan GT (2017) A compara-tive study of diagnosing thyroid diseases using classifi-cation algorithm. Int JAdv Res Comput Sci Softw Eng 7(8):181–184.

Moshayedi A. J., Gharpure, D. C. (2017, May). Evaluation of bio inspired Mokhtar: Odor localization system. In 2017 18th international carpathian control conference (ICCC) (pp. 527-532). IEEE.

IoniN I., IoniNL. (2016) Prediction of thyroid disease using data mining techniques. Brain Broad Res Artif Intell Neurosci 7(3):115–127.

Rathi M, Pareek V (2016) Disease prediction tool: an integrated hybrid data mining approach for healthcare. IRACST Int J Comput Sci Inf Technol Secur (IJCSITS) 6(6):32–40

K. Geetha, Baboo CSS (2016) Efficient thyroid disease classification using differential evolution with SVM. J Theoret Appl Inf Technol 88(3):410–420.

Soleimani M, Mirshahzadeh AS. Multi-class Classifica-tion of Imbalanced Intelligent Data using Deep Neural Network. EAI Endorsed Transactions on AI and Robotics. 2023 Jul 12;2.

Moshayedi, A. J., Chen, Z. Y., Liao, L., Li, S. (2022). Sunfa Ata Zuyan machine learning models for moon phase detection: algorithm, prototype and performance com-parison. TELKOMNIKA (Telecommunication Computing Electronics and Control), 20(1), 129-140.

Razia S, Narasingarao MR, Sridhar GR (2015) Decision support system for prediction of thyroid disease—a comparison of multilayer perceptron neural network and radial function neural network. J Theoret Appl Inf Technol 80(3):544–551.

Gaikwad S, Pise N (2014) An experimental study on hypothyroid using rotation forest. Int J Data Min Knowl Manag Process (IJDKP) 4(6):31. https://doi.org/10.5121/ijdkp.2014.4603.

Hoorfar H, Bagheri A. Minimum hidden guarding of histogram polygons. arXiv preprint arXiv:1708.05815. 2017 Aug 19.

Soleimani M, Naderi MH, Ashrafi AR. Tensor product of the power graph of some finite rings. Facta Universitatis, Series: Mathematics and Informatics. 2019 Mar 13:101-22.

Soleimani M, Mahmudi F, Naderi MH. Some results on the maximal graph of commutative rings. Advanced Studies: Euro-Tbilisi Mathematical Journal. 2023 Mar;16(supp1):21-6.

Mahmudi F, Soleimani M, Naderi MH. Some Properties of the Maximal Graph of a Commutative Ring. Southeast Asian Bulletin of Mathematics. 2019 Jul 1;43(4).

Hoorfar H, Bagheri A. A New Optimal Algorithm for Computing the Visibility Area of a simple Polygon from a Viewpoint. arXiv preprint arXiv:1803.10184. 2018 Mar 27.

Moshayedi A. J., Hosseini M. S. , F. Rezaee. (2019). WiFi based massager device with NodeMCU through arduino interpreter. Journal of Simulation and Analysis of Novel Technologies in Mechanical Engineering, 11(1), 73-79.

Downloads

Published

24-07-2023

How to Cite

[1]
M. Soleimani, Z. Forouzanfar, M. Soltani, and M. Jafari Harandi, “Imbalanced Multiclass Medical Data Classification based on Learning Automata and Neural Network”, EAI Endorsed Trans AI Robotics, vol. 2, Jul. 2023.