A Review of Image Classification Algorithms in IoT

Authors

DOI:

https://doi.org/10.4108/eetiot.v7i28.562

Keywords:

IOT, Convolutional Neural Network, Image Classification, Deep Learning

Abstract

With the advent of big data era and the enhancement of computing power, Deep Learning has swept the world. Based on Convolutional Neural Network (CNN) image classification technique broke the restriction of classical image classification methods, becoming the dominant algorithm of image classification. How to use CNN for image classification has turned into a hot spot. After systematically studying convolutional neural network and in-depth research of the application of CNN in computer vision, this research briefly introduces the mainstream structural models, strengths and shortcomings, time/space complexity, challenges that may be suffered during model training and associated solutions for image classification. This research also compares and analyzes the differences between different methods and their performance on commonly used data sets. Finally, the shortcomings of Deep Learning methods in image classification and possible future research directions are discussed.

Downloads

Download data is not yet available.
<br data-mce-bogus="1"> <br data-mce-bogus="1">

References

S. Bhattacharyya, "A Brief Survey of Color Image Preprocessing and Segmentation Techniques," Journal of Pattern Recognition Research, vol. 1, no. 1, pp. 120-129, 2011. DOI: https://doi.org/10.13176/11.191

M. A. Vega-Rodriguez, Review: Feature Extraction and Image Processing. Feature extraction and image processing, 2002.

D. Zhang, B. Liu, C. Sun, and X. Wang, "Learning the Classifier Combination for Image Classification," Journal of Computers, vol. 6, no. 8, pp. 1756-1763, 2011. DOI: https://doi.org/10.4304/jcp.6.8.1756-1763

M. Grabner, H. Grabner, and H. Bischof, "Fast approximated SIFT," in Asian Conference on Computer Vision, 2006. DOI: https://doi.org/10.1007/11612032_92

L. He, C. Zou, L. Zhao, and D. Hu, "An Enhanced LBP Feature Based on Facial Expression Recognition," in IEEE Engineering in Medicine & Biology Conference, 2005, pp. 3300-3303.

T. T. Do and E. Kijak, "FACE RECOGNITION USING CO-OCCURRENCE HISTOGRAMS OF ORIENTED GRADIENTS," in IEEE International Conference on Acoustics, 2012.

G. Amato and F. Falchi, "Local Feature based Image Similarity Functions for kNN Classification," in ICAART 2011 - Proceedings of the 3rd International Conference on Agents and Artificial Intelligence, Volume 1 - Artificial Intelligence, Rome, Italy, January 28-30, 2011, 2011.

T. Joachims, "Making Large-Scale SVM Learning Practical," Technical Reports, vol. 8, no. 3, pp. 499-526, 1998.

L. Deng and D. Yu, "Deep Learning: Methods and Applications," Foundations & Trends in Signal Processing, vol. 7, no. 3, pp. 197-387, 2014. DOI: https://doi.org/10.1561/2000000039

Q. V. Le, J. Ngiam, A. Coates, A. Lahiri, and A. Y. Ng, "On Optimization Methods for Deep Learning," in International Conference on Machine Learning, 2011.

Y. Lecun, Y. Bengio, and G. Hinton, "Deep learning," Nature, vol. 521, no. 7553, p. 436, 2015. DOI: https://doi.org/10.1038/nature14539

S. Bhardwaj, S. Tewari, and S. Jain, "Study on Future of Artificial Intelligence in Neural."

L. Zhou, S. Pan, J. Wang, and A. V. Vasilakos, "Machine Learning on Big Data: Opportunities and Challenges," Neurocomputing, vol. 237, no. MAY10, pp. 350-361, 2017. DOI: https://doi.org/10.1016/j.neucom.2017.01.026

S. C. Satapathy and D. Wu, "Improving ductal carcinoma in situ classification by convolutional neural network with exponential linear unit and rank-based weighted pooling," Complex Intell. Syst., vol. 7, pp. 1295-1310, 2020/11/22 2021. DOI: https://doi.org/10.1007/s40747-020-00218-4

I. Aliyu, K. J. Gana, A. A. Musa, M. A. Adegboye, and C. G. Lim, "Incorporating Recognition in Catfish Counting Algorithm Using Artificial Neural Network and Geometry," Ksii Transactions on Internet and Information Systems, vol. 14, no. 12, pp. 4866-4888, Dec 2020.

T. Vaidya and K. Chatterjee, "Enhancing the stability of active harmonic filter using artificial neural network-based current control scheme," IET Power Electronics, vol. 13, no. 19, pp. 4601-4609, Dec 2020. DOI: https://doi.org/10.1049/iet-pel.2020.0364

Y.-D. Zhang, "A five-layer deep convolutional neural network with stochastic pooling for chest CT-based COVID-19 diagnosis," Machine Vision and Applications, vol. 32, 2021, Art no. 14. DOI: https://doi.org/10.1007/s00138-020-01128-8

D. R. Nayak, "Detection of unilateral hearing loss by Stationary Wavelet Entropy," CNS & Neurological Disorders - Drug Targets, vol. 16, no. 2, pp. 15-24, 2017. DOI: https://doi.org/10.2174/1871527315666161024142036

Y. Jiang, "Exploring a smart pathological brain detection method on pseudo Zernike moment," Multimedia Tools and Applications, vol. 77, no. 17, pp. 22589-22604, 2018. DOI: https://doi.org/10.1007/s11042-017-4703-0

J. Li, "Detection of Left-Sided and Right-Sided Hearing Loss via Fractional Fourier Transform," Entropy, vol. 18, no. 5, 2016, Art no. 194. DOI: https://doi.org/10.3390/e18050194

A. Krizhevsky, I. Sutskever, and G. Hinton, "ImageNet Classification with Deep Convolutional Neural Networks," Advances in neural information processing systems, vol. 25, no. 2, 2012.

M. Oquab, L. Bottou, I. Laptev, and J. Sivic, "learning and transferring mid-level image representions using convolutional neural networks," 2017.

S. Zagoruyko and N. Komodakis, "Learning to Compare Image Patches via Convolutional Neural Networks," in 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015. DOI: https://doi.org/10.1109/CVPR.2015.7299064

P. Prakash, A. Banerjee, and P. K. Perepu, "Determination of the relative inclination and the viewing angle of an interacting pair of galaxies using Convolutional Neural Networks," Monthly Notices of the Royal Astronomical Society, vol. 497, no. 3, pp. 3323-3334, Sep 2020. DOI: https://doi.org/10.1093/mnras/staa2109

Y. D. Zhang, "A seven-layer convolutional neural network for chest CT based COVID-19 diagnosis using stochastic pooling," IEEE Sens. J., pp. 1-1. doi: 10.1109/JSEN.2020.3025855 DOI: https://doi.org/10.1109/JSEN.2020.3025855

D. S. Guttery, "Improved Breast Cancer Classification Through Combining Graph Convolutional Network and Convolutional Neural Network," Information Processing and Management, vol. 58, 2, 2021, Art no. 102439. DOI: https://doi.org/10.1016/j.ipm.2020.102439

W. Zhu, "ANC: Attention Network for COVID-19 Explainable Diagnosis Based on Convolutional Block Attention Module," Computer Modeling in Engineering & Sciences, vol. 127, 3, pp. 1037-1058, 2021. DOI: https://doi.org/10.32604/cmes.2021.015807

H. Uzen and K. Hanbay, "LM Filter-Based Deep Convolutional Neural Network for Pedestrian Attribute Recognition," Journal of Polytechnic-Politeknik Dergisi, vol. 23, no. 3, pp. 605-613, Sep 2020.

S. Vagios et al., "THE USE OF DEEP CONVOLUTIONAL NEURAL NETWORKS (CNN) TO OBJECTIVELY ASSESS THE ROLE OF ABSTINENCE ON IVF DEVELOPMENTAL OUTCOMES," Fertility and Sterility, vol. 114, no. 3, pp. E141-E141, Sep 2020. DOI: https://doi.org/10.1016/j.fertnstert.2020.08.416

E. Villain et al., "Convolutional neural network for discriminating between Multiple System Atrophy and Healthy Control, comparing MRI modalities and highlighting the disease signature," Movement Disorders, vol. 35, pp. S121-S122, Sep 2020.

Q. Abbas, "Object Recognition using Template Matching and Pre-trained convolutional neural network," International Journal of Computer Science and Network Security, vol. 20, no. 8, pp. 69-79, Aug 2020.

Q. Zhou, "ADVIAN: Alzheimer's Disease VGG-Inspired Attention Network Based on Convolutional Block Attention Module and Multiple Way Data Augmentation," Front. Aging Neurosci., vol. 13, 2021, Art no. 687456. DOI: https://doi.org/10.3389/fnagi.2021.687456

Z. Zhang and X. Zhang, "MIDCAN: A multiple input deep convolutional attention network for Covid-19 diagnosis based on chest CT and chest X-ray," Pattern Recognition Letters, vol. 150, pp. 8-16, 2021. DOI: https://doi.org/10.1016/j.patrec.2021.06.021

Y.-D. Zhang, D. R. Nayak, X. Zhang, and S.-H. Wang, "Diagnosis of secondary pulmonary tuberculosis by an eight-layer improved convolutional neural network with stochastic pooling and hyperparameter optimization," Journal of Ambient Intelligence and Humanized Computing, Accessed on: 2020/10/23. doi: 10.1007/s12652-020-02612-9 [Online]. Available: https://doi.org/10.1007/s12652-020-02612-9 DOI: https://doi.org/10.1007/s12652-020-02612-9

J. Jo, H. I. Koo, J. W. Soh, and N. I. Cho, "Handwritten Text Segmentation via End-to-End Learning of Convolutional Neural Networks," Multimedia Tools and Applications, vol. 79, no. 43-44, pp. 32137-32150, Nov 2020. DOI: https://doi.org/10.1007/s11042-020-09624-9

S. K. Ghosh, B. Biswas, and A. Ghosh, "Development of intuitionistic fuzzy special embedded convolutional neural network for mammography enhancement," Comput. Intell., vol. 37, no. 1, pp. 47-69, Feb 2021. DOI: https://doi.org/10.1111/coin.12391

M. C. Kim and J. H. Lee, "Color reproduction in virtual lip makeup using a convolutional neural network," Color Research and Application, vol. 45, no. 6, pp. 1190-1201, Dec 2020. DOI: https://doi.org/10.1002/col.22549

A. Giusti, D. C. Cirean, J. Masci, L. M. Gambardella, and J. Schmidhuber, "Fast Image Scanning with Deep Max-Pooling Convolutional Neural Networks," IEEE, 2013. DOI: https://doi.org/10.1109/ICIP.2013.6738831

K. Wu, "SOSPCNN: Structurally Optimized Stochastic Pooling Convolutional Neural Network for Tetralogy of Fallot Recognition," Wireless Communications and Mobile Computing, vol. 2021, p. 5792975, 2021/07/02 2021, Art no. 5792975.

V. Govindaraj, "Deep Rank-Based Average Pooling Network for Covid-19 Recognition," Computers, Materials & Continua, vol. 70, 2, pp. 2797-2813, 2022. DOI: https://doi.org/10.32604/cmc.2022.020140

X. Cheng, "PSSPNN: PatchShuffle Stochastic Pooling Neural Network for an Explainable Diagnosis of COVID-19 with Multiple-Way Data Augmentation," Computational and Mathematical Methods in Medicine, vol. 2021, 2021, Art no. 6633755. DOI: https://doi.org/10.1155/2021/6633755

H. Shirai et al., "Estimation of the Number of Convallaria Keiskei's Colonies UsingUAVImages Based on a Convolutional Neural Network," IEEJ Transactions on Electrical and Electronic Engineering, vol. 15, no. 10, pp. 1552-1554, Oct 2020. DOI: https://doi.org/10.1002/tee.23226

R. Abed, S. Bahroun, and E. Zagrouba, "KeyFrame extraction based on face quality measurement and convolutional neural network for efficient face recognition in videos," Multimedia Tools and Applications, vol. 80, no. 15, pp. 23157-23179, Jun 2021. DOI: https://doi.org/10.1007/s11042-020-09385-5

T. Ameen, L. Chen, Z. X. Xu, D. D. Lyu, and H. Y. Shi, "A Convolutional Neural Network and Matrix Factorization-Based Travel Location Recommendation Method Using Community-Contributed Geotagged Photos," Isprs International Journal of Geo-Information, vol. 9, no. 8, Aug 2020, Art no. 464. DOI: https://doi.org/10.3390/ijgi9080464

D. Anderson, "Deep fractional max pooling neural network for COVID-19 recognition," Frontiers in Public Health, vol. 9, 2021, Art no. 726144. DOI: https://doi.org/10.3389/fpubh.2021.726144

M. León, A. Moreno-Báez, R. Magallanes-Quintanar, and R. D. Valdez-Cepeda, "Assessment in subsets of MNIST Handwritten Digits and their Effect in the Recognition Rate," Journal of Pattern Recognition Research, vol. 2, no. 2, 2011. DOI: https://doi.org/10.13176/11.348

H. Li, H. Liu, X. Ji, G. Li, and L. Shi, "CIFAR10-DVS: An Event-Stream Dataset for Object Classification," Frontiers in Neuroscience, vol. 11, 2017. DOI: https://doi.org/10.3389/fnins.2017.00309

D. Jia, D. Wei, R. Socher, L. J. Li, L. Kai, and F. F. Li, "ImageNet: A large-scale hierarchical image database," 2009, pp. 248-255.

K. He, X. Zhang, S. Ren, and J. Sun, "Deep Residual Learning for Image Recognition," IEEE, 2016. DOI: https://doi.org/10.1109/CVPR.2016.90

C. Szegedy, W. Liu, Y. Jia, P. Sermanet, and A. Rabinovich, "Going Deeper with Convolutions," IEEE Computer Society, 2014. DOI: https://doi.org/10.1109/CVPR.2015.7298594

K. Simonyan and A. Zisserman, "Very Deep Convolutional Networks for Large-Scale Image Recognition," arXiv, 2014.

M. Jaderberg, A. Vedaldi, and A. Zisserman, "Speeding up Convolutional Neural Networks with Low Rank Expansions," Computer ence, vol. 4, no. 4, p. XIII, 2014. DOI: https://doi.org/10.5244/C.28.88

X. Liu, J. Pool, S. Han, and W. J. Dally, "Efficient Sparse-Winograd Convolutional Neural Networks," 2018.

S. Srinivas and R. V. Babu, "Data-free parameter pruning for Deep Neural Networks," Computer Science, pp. 2830-2838, 2015. DOI: https://doi.org/10.5244/C.29.31

X. Qi, T. Wang, and J. Liu, "Comparison of Support Vector Machine and Softmax Classifiers in Computer Vision," in International Conference on Mechanical, 2018, pp. 151-155. DOI: https://doi.org/10.1109/ICMCCE.2017.49

S. C. Satapathy, "Fruit category classification by fractional Fourier entropy with rotation angle vector grid and stacked sparse autoencoder," Expert Syst., vol. 39, no. 3, 2022, Art no. e12701. DOI: https://doi.org/10.1111/exsy.12701

Z. Zhu, "PSCNN: PatchShuffle Convolutional Neural Network for COVID-19 Explainable Diagnosis," Frontiers in Public Health, vol. 9, 2021, Art no. 768278. DOI: https://doi.org/10.3389/fpubh.2021.768278

C. Tang, "Twelve-layer deep convolutional neural network with stochastic pooling for tea category classification on GPU platform," Multimedia Tools and Applications, vol. 77, no. 17, pp. 22821-22839, 2018. DOI: https://doi.org/10.1007/s11042-018-5765-3

S. Purwar, R. Tripathi, A. W. Barwad, and A. K. Dinda, "Detection of Mesangial hypercellularity of MEST-C score in immunoglobulin A-nephropathy using deep convolutional neural network," Multimedia Tools and Applications, vol. 79, no. 37-38, pp. 27683-27703, Oct 2020. DOI: https://doi.org/10.1007/s11042-020-09304-8

M. L. George, T. Govindarajan, K. A. Rajasekaran, and S. R. Bandi, "A robust similarity based deep siamese convolutional neural network for gait recognition across views," Comput. Intell., vol. 36, no. 3, pp. 1290-1319, Aug 2020. DOI: https://doi.org/10.1111/coin.12361

A. B. Jiang and W. W. Wang, "Research on optimization of ReLU activation function," Transducer and Microsystem Technologies, 2018.

Z. W. Yuan and J. Zhang, "Feature extraction and image retrieval based on AlexNet," in Eighth International Conference on Digital Image Processing (ICDIP 2016), 2016. DOI: https://doi.org/10.1117/12.2243849

Y. Gal and Z. Ghahramani, "Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning," JMLR.org, 2015.

L. Perez and J. Wang, "The Effectiveness of Data Augmentation in Image Classification using Deep Learning," 2017.

R. K. Pandey and R. A. Ganesan, "DeepInterpolation: fusion of multiple interpolations and CNN to obtain super-resolution," IET Image Processing, 2021. DOI: https://doi.org/10.1049/iet-ipr.2019.1244

L. Wang, Y. Xiong, Z. Wang, and Y. Qiao, "Towards Good Practices for Very Deep Two-Stream ConvNets," Computer Science, 2015.

R. K. Srivastava, K. Greff, and J. Schmidhuber, "Highway Networks," Computer Science, 2015.

A. Bordes, L. Bottou, and P. Gallinari, "SGD-QN: Careful Quasi-Newton Stochastic Gradient Descent," Journal of Machine Learning Research, vol. 10, no. 3, pp. 1737-1754, 2009.

C. Szegedy, S. Ioffe, V. Vanhoucke, and A. Alemi, "Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning," 2016.

J. Schmidt-Hieber, "Nonparametric regression using deep neural networks with ReLU activation function," 2017.

H. Jie, S. Li, S. Gang, and S. Albanie, "Squeeze-and-Excitation Networks," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. PP, no. 99, 2017.

A. G. Howard et al., "MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications," 2017.

M. Cogswell, F. Ahmed, R. Girshick, L. Zitnick, and D. Batra, "Reducing Overfitting in Deep Networks by Decorrelating Representations," Computer Science, 2015.

S. Yeom, I. Giacomelli, M. Fredrikson, and S. Jha, "Privacy Risk in Machine Learning: Analyzing the Connection to Overfitting," in IEEE Computer Security Foundations Symposium, 2017. DOI: https://doi.org/10.1109/CSF.2018.00027

L. Lin, N. J. Higham, and J. Pan, "Covariance structure regularization via entropy loss function," Computational Statistics & Data Analysis, vol. 72, no. 3, pp. 315-327, 2014. DOI: https://doi.org/10.1016/j.csda.2013.10.004

S. Narayan and G. Tagliarini, "An analysis of underfitting in MLP networks," in IEEE International Joint Conference on Neural Networks, 2005.

S. Squartini, S. Paolinelli, and F. Piazza, "Comparing Different Recurrent Neural Architectures on a Specific Task from Vanishing Gradient Effect Perspective," in IEEE International Conference on Networking, 2006.

R. Pascanu, T. Mikolov, and Y. Bengio, "Understanding the exploding gradient problem," Arxiv Preprint Arxiv, 2012.

N. Gajhede, O. Beck, and H. Purwins, "Convolutional Neural Networks with Batch Normalization for Classifying Hi-hat, Snare, and Bass Percussion Sound Samples," ACM, pp. 111-115, 2016. DOI: https://doi.org/10.1145/2986416.2986453

M. Simon, E. Rodner, and J. Denzler, "ImageNet pre-trained models with batch normalization," 2016.

J. Kohler, H. Daneshmand, A. Lucchi, M. Zhou, K. Neymeyr, and T. Hofmann, "Towards a Theoretical Understanding of Batch Normalization," 2018.

Downloads

Published

21-04-2022

How to Cite

[1]
X. Zheng and R. S. Cloutier, “A Review of Image Classification Algorithms in IoT”, EAI Endorsed Trans IoT, vol. 7, no. 28, p. e4, Apr. 2022.