Multi-class Classification of Imbalanced Intelligent Data using Deep Neural Network
DOI:
https://doi.org/10.4108/airo.3486Keywords:
Imbalanced Data, Dynamic Sampling, Deep Neural NetworkAbstract
In recent years, studies in the field of deep learning have made significant progress. These studies have focused
more on datasets with balanced classification, and less research has been done on imbalanced datasets, which
are of great importance in the real world and present significant challenges for classification. This article
studies the problem of classifying imbalanced data, introduces dynamic sampling for deep neural networks,
investigates the imbalanced multiclass problem, and proposes a dynamic sampling method for deep learning.
In our proposed method, all samples are fed to the current deep neural network for each training iteration,
and the accuracy, precision, and mean error of the deep neural network are estimated. The proposed method
dynamically selects informative data for training the deep neural network. Comprehensive experiments were
conducted to evaluate and understand its strengths and weaknesses. The results of 13 imbalanced multiclass
datasets show that the proposed method outperforms other methods, such as initial sampling techniques,
active learning, cost-sensitive learning, and reinforcement learning.
Downloads
References
Moshayedi, A.J.; Roy, A.S.; Taravet, A.; Liao, L.; Wu, J.; Gheisari, M. A Secure Traffic Police Remote Sensing Approach via a Deep Learning-Based Low-Altitude Vehicle Speed Detector through UAVs in Smart Cites: Algorithm, Implementation and Evaluation. Future Transp. 3, 189-209, 2023.
W. Han, Z. Huang, S. Li, and Y. Jia, "Distributionsensitive unbalanced data oversampling method for medical diagnosis," Journal of medical Systems, vol. 43, pp. 1-10, 2019.
Khan AR, Doosti F, Karimi M, Harouni M, Tariq U, Fati SM, Ali Bahaj S. Authentication through gender classification from iris images using support vector machine. Microscopy research and technique. 2021 Nov;84(11):2666-76.
M. Soleimani, F. Mahmudi, and M. Naderi, "Some results on the maximal graph of commutative rings," Advanced Studies: Euro-Tbilisi Mathematical Journal, vol. 16, no. supp1, pp. 21-26, 2023.
M. Zarei, A. J. Moshayedi, Y. Zhong, A. S. Khan, A. Kolahdooz and M. E. Andani, "Indoor UAV Object Detection Algorithms On Three Processors: Implementation Test And Comparison," 2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE), Guangzhou, China, pp. 812-819, 2023.
T. M. Khoshgoftaar, J. Van Hulse, and A. Napolitano, "Supervised neural network modeling: an empirical investigation into learning from imbalanced data with labeling errors," IEEE Transactions on Neural Networks, vol. 21, no. 5, pp. 813-830, 2010.
N. V. Chawla, N. Japkowicz, and A. Kotcz, "Special issue on learning from imbalanced data sets," ACM SIGKDD explorations newsletter, vol. 6, no. 1, pp. 1-6, 2004.
F. Mahmudi and M. Soleimani, "Some results on Maximal Graph of a Commutative Ring," 2016.
H. He and E. A. Garcia, "Learning from imbalanced data," IEEE Transactions on knowledge and data engineering, vol. 21, no. 9, pp. 1263-1284, 2009.
Farhadian N, Haroni M, Soleimani Neysiani B. An Intelligent Novel Hybrid Live Video Streaming Method in Mesh-Based Peer-to-Peer Networks. Nashriyyah-i Muhandisi-i Barq va Muhandisi-i Kampyutar-i Iran. 2021 Aug;84(4):261.
Harouni M, Baghmaleki HY. Color image segmentation metrics. Encyclopedia of Image Processing. 2018 Nov 8;95:10-21.
D. Thammasiri, D. Delen, P. Meesad, and N. Kasap, "A critical assessment of imbalanced class distribution problem: The case of predicting freshmen student attrition," Expert Systems with Applications, vol. 41, no. 2, pp. 321-330, 2014.
F. Mahmudi, M. Soleimani, and M. Naderi, "Some Properties of the Maximal Graph of a Commutative Ring," Southeast Asian Bulletin of Mathematics, vol. 43, no. 4, 2019.
A. Zughrat, M. Mahfouf, Y. Yang, and S. Thornton, "Support vector machines for class imbalance rail data classification with bootstrapping-based over-sampling and under-sampling," IFAC Proceedings Volumes, vol. 47, no. 3, pp. 8756-8761, 2014.
Z.H. Zhou and X.-Y. Liu, "Training cost-sensitive neural networks with methods addressing the class imbalance problem," IEEE Transactions on knowledge and data engineering, vol. 18, no. 1, pp. 63-77, 2005.
T. M. Khoshgoftaar, J. Van Hulse, and A. Napolitano, "Comparing boosting and bagging techniques with noisy and imbalanced data," IEEE Transactions on Systems, Man, and Cybernetics-Part A: Systems and Humans, vol. 41, no. 3, pp. 552-568, 2010.
Y. Sun, M. S. Kamel, and Y. Wang, "Boosting for learning multiple classes with imbalanced class distribution," in
Sixth international conference on data mining (ICDM’06), 2006: IEEE, pp. 592-602.
Y. Sun, M. S. Kamel, A. K. Wong, and Y. Wang, "Costsensitive boosting for classification of imbalanced data," Pattern recognition, vol. 40, no. 12, pp. 3358-3378, 2007.
S. Chen, H. He, and E. A. Garcia, "RAMOBoost: Ranked minority oversampling in boosting," IEEE Transactions on Neural Networks, vol. 21, no. 10, pp. 1624-1642, 2010.
H. Patel, D. Singh Rajput, G. Thippa Reddy, C. Iwendi, A. Kashif Bashir, and O. Jo, "A review on classification of imbalanced data for wireless sensor networks," International Journal of Distributed Sensor Networks, vol. 16, no. 4, p. 1550147720916404, 2020.
S. Ertekin, J. Huang, L. Bottou, and L. Giles, "Learning on the border: active learning in imbalanced data classification," in Proceedings of the sixteenth ACM conference on Conference on information and knowledge management, 2007, pp. 127-136.
A. R. Khan, S. Khan, M. Harouni, R. Abbasi, S. Iqbal, and Z. Mehmood, "Brain tumor segmentation using Kmeans clustering and deep learning with synthetic data augmentation for classification," Microscopy Research and Technique, vol. 84, no. 7, pp. 1389-1399, 2021.
M. Lin, K. Tang, and X. Yao, "Dynamic sampling approach to training neural networks for multiclass imbalance classification," IEEE Transactions on Neural Networks and Learning Systems, vol. 24, no. 4, pp. 647- 660, 2013.
M. Soleimani, F. Mahmudi, and M. H. Naderi, "On the Maximal Graph of a Commutative Ring," Mathematics Interdisciplinary Research, 2021.
M. Karimi, M. Harouni, E. I. Jazi, A. Nasr, and N. Azizi, "Improving Monitoring and Controlling Parameters for Alzheimer’s Patients Based on IoMT," in Prognostic Models in Healthcare: AI and Statistical Approaches: Springer, 2022, pp. 213-237.
A. J. Moshayedi, S. M. Zanjani, D. Xu, X. Chen, G. Wang and S. Yang, "Fusion BASED AGV Robot Navigation Solution Comparative Analysis and Vrep Simulation," 2022 8th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS), Behshahr, Iran, Islamic Republic of, pp. 1-11, 2022.
Moshayedi AJ, Khan AS, Shuxin Y, Kuan G, Jiandong H, Soleimani M, Razi A. E-Nose design and structures from statistical analysis to application in robotic: a compressive review. EAI Endorsed Transactions on AI and Robotics. 2023 Apr 20;2(1):e1.
Hoorfar H, Bagheri A. Geometric Embedding of Path and Cycle Graphs in Pseudo-convex Polygons. arXiv preprint arXiv:1708.01457. 2017 Aug 4.
M. Soleimani, M. H. Naderi, and A. R. Ashrafi, "Tensor product of the power graph of some finite rings," Facta Universitatis, Series: Mathematics and Informatics, pp. 101-122, 2019.
D. J. Hand and R. J. Till, "A simple generalisation of the area under the ROC curve for multiple class classification problems," Machine learning, vol. 45, pp. 171-186, 2001.
A. Asuncion and D. Newman, "UCI machine learning repository," ed: Irvine, CA, USA, 2007.
Hoorfar H, Bagheri A. Guarding Path Polygons with Orthogonal Visibility. arXiv preprint arXiv:1709.01569. 2017 Sep 5.
Hoorfar H, Bagheri A. Minimum hidden guarding of histogram polygons. arXiv preprint arXiv:1708.05815. 2017 Aug 19.
Moshayedi AJ, Hosseinzadeh M, Joshi BP, Emadi Andani M. Recognition System for Ergonomic Mattress and Pillow: Design and Fabrication. IETE Journal of Research. 2023 Jan 10:1-9.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2023 Masoumeh Soleimani, Akram Sadat Mirshahzadeh
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.