Deep Learning Application Pros And Cons Over Algorithm
Keywords:Deep learning, face recognition, speech recognition, medical image recognition, character recognition
Deep learning is a new area of machine learning research. Deep learning technology applies the nonlinear and advanced transformation of model abstraction into a large database. The latest development shows that deep learning in various fields and greatly contributed to artificial intelligence so far. This article reviews the contributions and new applications of deep learning. The main target of this review is to give the summarize points for scholars to have the analysis about applications and algorithms. Then review tries to investigate the main applications and uses algorithms. In addition, the advantages of using the method of deep learning and its hierarchical and nonlinear functioning are introduced and compared to traditional algorithms in common applications. The following three criteria should be taken into consideration when choosing the area of application. (1) expertise or knowledge of the author; (2) the successful application of deep learning technology has changed the field of application, such as voice recognition, chat robots, search technology and vision; and (3) deep learning can have a significant impact on the application domain and benefit from recent research with natural language and text processing, information recovery and multimodal information processing resulting from multitasking deep learning. This review provides a general overview of a new concept and the growing benefits and popularity of deep learning, which can help researchers and students interested in deep learning methods.
Liu W, Wen Y, Yu Z, Li M, B Raj, Song L, Sphereface: Deep hypersphere embedding for face recognition, in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017, 212–220.
Tran L, Yin X, Liu X, Disentangled representation learning gan for pose-invariant face recognition. in Proceedings of the IEEE conference on computer vision and pattern recognition. 2017; 1415–1424.
Yang J, et al., Neural aggregation network for video face recognition. in Proceedings of the IEEE conference on computer vision and pattern recognition, 2017; 4362–4371.
Ding H, Zhou SK, Chellappa R, Facenet2expnet: Regularizing a deep face recognition net for expression recognition. in 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition. 2017; 118–126.
Ding C, Tao D, Trunk-branch ensemble convolutional neural networks for video-based face recognition,” IEEE Trans. Pattern Anal. Mach. Intell., 2017; 40(4):1002–1014.
A Nech, I Kemelmacher-Shlizerman, Level playing field for million scale face recognition. in Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, 2017; 2017-Janua:3406–3415, DOI: 10.1109/CVPR.2017.363.
Wang H, et al., Cosface: Large margin cosine loss for deep face recognition. in Proceedings of the IEEE conference on computer vision and pattern recognition, 2018, 5265–5274.
Mittal S, Agarwal S, Nigam MJ, Real time multiple face recognition: A deep learning approach. in ACM International Conference Proceeding Series, 2018, 70–76, DOI: 10.1145/3299852.3299853.
Cheng WC, Wu TY, Li DW, Ensemble convolutional neural networks for face recognition. ACM Int. Conf. Proceeding Ser. 2018; 40(4):1002–1014. DOI: 10.1145/3302425.3302459.
Deng J, Guo J, Xue N, Zafeiriou S, Arcface: Additive angular margin loss for deep face recognition. in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019, 4690–4699.
Rabiner LR, Applications of speech recognition in the area of telecommunications. in 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings, 1997, 501–510.
Deng L, Huang X, Challenges in adopting speech recognition. Commun. ACM, 2004; 47(1) 69–75.
Quoc CN, Tien DT, Dang KN, Huu BN, Robust speech recognition based on binaural speech enhancement system as a preprocessing step. ACM Int. Conf. Proceeding Ser., 2012; 91–96. DOI: 10.1145/2350716.2350732.
Miao Y, Metze F, Rawat S, Deep maxout networks for low-resource speech recognition. in 2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 2013, 398–403.
Xi X, Jingqian W, Improved lattice-based speech keyword spotting algorithm. J. Tsinghua Univ. Science Technol. 2015; 55(5):508–513.
Noda K, Yamaguchi Y, Nakadai K, Okuno HG, Ogata T, Audio-visual speech recognition using deep learning. Appl. Intell. 2015; 42(4):722–737.
Chorowski J, Bahdanau D, Serdyuk D, Cho K, Bengio Y, Attention-based models for speech recognition arXiv Prepr. arXiv1506.07503, 2015.
Ravanelli M, Brakel P, Omologo M, Bengio Y, Light gated recurrent units for speech recognition. IEEE Trans. Emerg. Top. Comput. Intell. 2018; 2(2):92–102.
Image Recognition : A Complete Guide,” Deepomatic. Jan. 2019, Accessed: Sep. 25, 2021. https://deepomatic.com/en/what-is-image-recognition.
Pak M, Kim S, A review of deep learning in image recognition. in 2017 4th International Conference on Computer Applications and Information Processing Technology (CAIPT), 2017; 1–3, DOI: 10.1109/CAIPT.2017.8320684.
M. Wu and L. Chen, “Image recognition based on deep learning,” in 2015 Chinese Automation Congress (CAC), 2015, pp. 542–546, DOI: 10.1109/CAC.2015.7382560.
Miao Y, Metze F, Improving low-resource CD-DNN-HMM using dropout and multilingual DNN training. in Interspeech, 2013; 13:2237–2241.
Shen J, Shafiq MO, Deep Learning Convolutional Neural Networks with Dropout - A Parallel Approach,” in 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), 2018; 572–577, DOI: 10.1109/ICMLA.2018.00092.
Maalej R, Tagougui N, Kherallah M, Online Arabic Handwriting Recognition with Dropout Applied in Deep Recurrent Neural Networks. in 2016 12th IAPR Workshop on Document Analysis Systems (DAS), 2016; 417–421, DOI: 10.1109/DAS.2016.49.
Pham V, Bluche T, Kermorvant C, Louradour J, Dropout Improves Recurrent Neural Networks for Handwriting Recognition. in 2014 14th International Conference on Frontiers in Handwriting Recognition, 2014; 285–290, DOI: 10.1109/ICFHR.2014.55.
Sun L, Su T, Liu C, Wang R, Deep LSTM Networks for Online Chinese Handwriting Recognition. in 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR). 2016; 271–276, DOI: 10.1109/ICFHR.2016.0059.
Elleuch M, Maalej R, Kherallah M, A New Design Based-SVM of the CNN Classifier Architecture with Dropout for Offline Arabic Handwritten Recognition. Procedia Comput. Sci., 2016; 80:1712–1723. DOI: https://doi.org/10.1016/j.procs.2016.05.512.
Yang HM, Zhang XY, Yin F, Luo Z, Liu CL, Unsupervised Adaptation of Neural Networks for Chinese Handwriting Recognition. in 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR), 2016, pp. 512–517, DOI: 10.1109/ICFHR.2016.0100.
Islam MT, Karim Siddique BMN, Rahman S, Jabid T, “Image Recognition with Deep Learning,” in 2018 International Conference on Intelligent Informatics and Biomedical Sciences (ICIIBMS), 2018; 3:106–110, doi: 10.1109/ICIIBMS.2018.8550021.
Botelho B, virtual assistant (AI assistant). techtarget, 2017. https://searchcustomerexperience.techtarget.com/definition/virtual-assistant-AI-assistant (accessed Aug. 21, 2021).
Duermyer R, Defenition and examples of virtual assistant: The balance small business. 2021. https://www.thebalancesmb.com/virtual-assistant-1794441 (accessed Aug. 21, 2021).
Page LC, Gehlbach H, How an Artificially Intelligent Virtual Assistant Helps Students Navigate the Road to College,” AERA Open. 2017; 3(4):2332858417749220. Oct., doi: 10.1177/2332858417749220.
Kepuska V, Bohouta G, Next-generation of virtual personal assistants (microsoft cortana, apple siri, amazon alexa and google home),” in 2018 IEEE 8th annual computing and communication workshop and conference (CCWC), 2018; 99–103.
Iannizzotto G, Lo Bello L, Nucita A, Grasso GM, A vision and speech enabled, customizable, virtual assistant for smart environments. in 2018 11th International Conference on Human System Interaction (HSI), 2018; 50–56.
Someshwar D, Bhanushali D, Chaudhari V, Nadkarni S, Implementation of Virtual Assistant with Sign Language using Deep Learning and TensorFlow. in 2020 Second International Conference on Inventive Research in Computing Applications (ICIRCA), 2020; 595–600.
Vishnu R, Krishna Prakash N, Mobile Application-Based Virtual Assistant Using Deep Learning. in Soft Computing and Signal Processing, 2021; 609–617.
Carnier M, Albertti R, Gavidia L, Severeyn E, A La Cruz, ToraxIA: Virtual Assistant for Radiologists Based on Deep Learning from Chest X-Ray. in Artificial Intelligence, Computer and Software Engineering Advances, 2021; 49–63.
Shawar BA, Atwell E, Chatbots: are they really useful?. in Ldv forum, 2007, vol. 22, no. 1, pp. 29–49.
Parmar M, Microsoft Bing search is getting its own AI-powered assistant. https://www.windowslatest.com/, 2021. https://www.windowslatest.com/2021/05/31/microsoft-bing-search-is-getting-its-own-ai-powered-assistant/ (accessed Aug. 27, 2021).
DALE R, The return of the chatbots,” Nat. Lang. Eng., vol. 2016; 22(5):811–817. DOI: DOI: 10.1017/S1351324916000243.
Bhagwat VA, Deep Learning for Chatbots. 2018.
Weizenbaum J, ELIZA — a Computer Program for the Study of Natural Language Communication between Man and Machine. Commun. ACM, 1983; 26(1):23–28, DOI: 10.1145/357980.357991.
Benoit R, Making a Clever Intelligent Agent: The Theory behind the Implementation, vol. 3. 2009.
Wu W, Yan R, Deep chit-chat: Deep learning for chatbots. in Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2019, 1413–1414.
Yan R, Song Y, Wu H, Learning to respond with deep neural networks for retrieval-based human-computer conversation system. in Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, 2016, pp. 55–64.
Yan R, Zhao D, Coupled context modeling for deep chit-chat: towards conversations between human and computer. in Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2018; 2574–2583.
Yan R, Chitty-Chitty-Chat Bot’: Deep Learning for Conversational AI. in IJCAI, 2018; 18:5520–5526.
Esteva A, et al., A guide to deep learning in healthcare,” Nat. Med. 2019; 25(1):24–29. DOI: 10.1038/s41591-018-0316-z.
Ardabili S, Mosavi A, Várkonyi-Kóczy AR, Advances in machine learning modeling reviewing hybrid and ensemble methods. in International Conference on Global Research and Education, 2019; 215–227.
Martis RJ, Gurupur VP, Lin H, Islam A, Fernandes SL, Recent advances in big data analytics, internet of things and machine learning. Elsevier, 2018.
Zou J, Huss M, Abid A, Mohammadi P, Torkamani A, Telenti A, A primer on deep learning in genomics. Nat. Genet. 2019; 51(1):12–18, DOI: 10.1038/s41588-018-0295-5.
Miotto R, Wang F, Wang S, Jiang X, Dudley JT, Deep learning for healthcare: review, opportunities and challenges,” Brief. Bioinform. 2018; 19(6):1236–1246. DOI: 10.1093/bib/bbx044.
Liu S, Liu S, Cai W, Pujol S, Kikinis R, Feng D, Early diagnosis of Alzheimer’s disease with deep learning. in 2014 IEEE 11th international symposium on biomedical imaging (ISBI), 2014; 1015–1018.
Brosch T, Tam R, Initiative ADN, Manifold learning of brain MRIs by deep learning. in International Conference on Medical Image Computing and Computer-Assisted Intervention, 2013; 633–640.
Prasoon A, Petersen K, Igel C, Lauze F, Dam E, Nielsen M, Deep feature learning for knee cartilage segmentation using a triplanar convolutional neural network. in International conference on medical image computing and computer-assisted intervention, 2013; 246–253.
Yoo Y, Brosch T, Traboulsee A, Li DKB, Tam R, Deep learning of image features from unlabeled data for multiple sclerosis lesion segmentation in International workshop on machine learning in medical imaging, 2014; 117–124.
Cheng JZ, et al., Computer-aided diagnosis with deep learning architecture: applications to breast lesions in us images and pulmonary nodules in CT scans. Sci Rep 6: 24454. 2016.
Gulshan V, et al., Development and validation of a deep learning algorithm for detection of diabetic retinopathy in retinal fundus photographs. Jama. 2016; 316(22):2402–2410.
Esteva A, et al., Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017; 542(7639):115–118.
Cheng Y, Wang F, Zhang P, Hu J, Risk prediction with electronic health records: A deep learning approach. in Proceedings of the 2016 SIAM International Conference on Data Mining, 2016; 432–440.
Lipton ZC, Kale DC, Elkan C, Wetzel R, Learning to diagnose with LSTM recurrent neural networks. arXiv Prepr. arXiv1511.03677, 2015.
Pham T, Tran T, Phung D, Venkatesh S, Deepcare: A deep dynamic memory model for predictive medicine. in Pacific-Asia conference on knowledge discovery and data mining, 2016; 30–41.
Miotto R, Li L, Kidd BA, Dudley JT, Deep patient: an unsupervised representation to predict the future of patients from the electronic health records. Sci. Rep. 2016; 6(1):1–10.
Miotto R, Li L, Dudley JT, Deep learning to predict patient future diseases from the electronic health records. in European conference on information retrieval, 2016; 768–774.
Liang Z, Zhang G, Huang JX, Hu QV, Deep learning for healthcare decision making with EMRs. in 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2014; 556–559.
Tran T, Nguyen TD, Phung D, Venkatesh S, Learning vector representation of medical objects via EMR-driven nonnegative restricted Boltzmann machines (eNRBM). J. Biomed. Inform., 2015; 54:96–105.
Che Z, Kale D, Li W, Bahadori MT, Liu Y, Deep computational phenotyping. in Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2015; 507–516.
Lasko TA, Denny JC, Levy MA, Computational phenotype discovery using unsupervised feature learning over noisy, sparse, and irregular clinical data,” PLoS One. 2013; 8(6):66341.
Choi E, Bahadori MT, Schuetz A, Stewart WF, J Sun, Doctor ai: Predicting clinical events via recurrent neural networks. in Machine learning for healthcare conference, 2016; 301–318.
Nguyen P, Tran T, Wickramasinghe N, Venkatesh S, Deepr: a convolutional net for medical records. IEEE J. Biomed. Heal. informatics. 2016; 21(1):22–30.
Razavian N, Marcus J, Sontag D, Multi-task prediction of disease onsets from longitudinal laboratory tests. in Machine learning for healthcare conference, 2016; 73–100.
Dernoncourt F, Lee JY, Uzuner O, Szolovits P, De-identification of patient notes with recurrent neural networks. J. Am. Med. Informatics Assoc. 2017; 24(3):596–606.
Zhou J, Troyanskaya OG, Predicting effects of noncoding variants with deep learning–based sequence model. Nat. Methods, 2015; 12(10):931–934.
Kelley DR, Snoek J, Rinn JL, Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks,” Genome Res. 2016; 26(7):990–999.
Alipanahi B, A Delong, MT Weirauch, BJ Frey, Predicting the sequence specificities of DNA-and RNA-binding proteins by deep learning,” Nat. Biotechnol., vol. 33, no. 8, pp. 831–838, 2015.
Angermueller C, Lee HJ, Reik W, Stegle O, Accurate prediction of single-cell DNA methylation states using deep learning. BioRxiv. 2016; 55715.
Koh PW, Pierson E, Kundaje A, Denoising genome-wide histone ChIP-seq with convolutional neural networks. Bioinformatics. 2017; 33(14):i225–i233.
Fakoor R, Ladhak F, Nazi A, Huber M, Using deep learning to enhance cancer diagnosis and classification. in Proceedings of the international conference on machine learning, 2013, vol. 28, pp. 3937–3949.
Lyons J, et al., Predicting backbone Cα angles and dihedrals from protein sequences by stacked sparse auto‐encoder deep neural network. J. Comput. Chem. 2014; 35(28):2040–2046.
Hammerla NY, Halloran S, Plötz T, Deep, convolutional, and recurrent models for human activity recognition using wearables. arXiv Prepr. arXiv1604.08880, 2016.
Zhu J, Pande A, Mohapatra P, Han JJ, Using deep learning for energy expenditure estimation with wearable sensors. in 2015 17th International Conference on E-health Networking, Application & Services (HealthCom), 2015; 501–506.
Jindal V, Birjandtalab J, Pouyan MB, Nourani M, An adaptive deep learning approach for PPG-based identification in 2016 38th Annual international conference of the IEEE engineering in medicine and biology society (EMBC), 2016; 6401–6404.
Nurse E, Mashford BS, Yepes AJ, Kiral-Kornek I, Harrer S, Freestone DR, Decoding EEG and LFP signals using deep learning: heading TrueNorth. in Proceedings of the ACM international conference on computing frontiers, 2016; 259–266.
Sathyanarayana A, et al., Sleep quality prediction from wearable data using deep learning. JMIR mHealth uHealth, 2016; 4(4):125.
Top Use Cases for AI in Media and Entertainment,” Dataiku. Sep. 22, 2021, [Online]. Available: https://www.dataiku.com/stories/ai-in-media-and-entertainment/.
Chou CH, Su YS, Hsu CJ, Lee KC, Han PH, Design of Desktop Audiovisual Entertainment System with Deep Learning and Haptic Sensations. Symmetry (Basel)., vol. 12, no. 10, 2020, DOI: 10.3390/sym12101718.
Erhel S, Jamet E, Digital game-based learning: Impact of instructions and feedback on motivation and learning effectiveness. Comput. Educ. 2013; 67:156–167. DOI: https://doi.org/10.1016/j.compedu.2013.02.019.
Justesen N, Bontrager P, Togelius J, Risi S, Deep learning for video game playing. IEEE Trans. Games. 2019; 12(1):1–20.
ParamitaGhosh, The Future of Deep Learning. DATAVERSITY. Sep. 22, 2020, [Online]. Available: https://www.dataversity.net/the-future-of-deep-learning/.
Gudmundsson SF, et al., Human-Like Playtesting with Deep Learning. in 2018 IEEE Conference on Computational Intelligence and Games (CIG), 2018; 1–8, DOI: 10.1109/CIG.2018.8490442.
Usukhbayar B, Homer S, Deepfake Videos: The Future of Entertainment. 2020.
Kulkarni R, Gaikwad R, Sugandhi R, Kulkarni P, Kone S, Survey on deep learning in music using GAN. Int. J. Eng. Res. Technol., vol. 8, no. 9, pp. 646–648, 2019.
Fessahaye F, et al., T-recsys: A novel music recommendation system using deep learning. in 2019 IEEE international conference on consumer electronics (ICCE), 2019; 1–6.
Nam J, Choi K, Lee J, Chou SY, Yang YH, Deep learning for audio-based music classification and tagging: Teaching computers to distinguish rock from bach. IEEE Signal Process. Mag. 2018; 36(1):41–51.
Sünderhauf N, et al., The limits and potentials of deep learning for robotics. Int. J. Rob. Res. 2018; 37(4–5):405–420.
Punjani A, Abbeel P, Deep learning helicopter dynamics models. IEEE Int. Conf. Robot. Autom., 2015; 3223–3230.
Miyajima R, Deep Learning Triggers a New Era in Industrial Robotics. IEEE Multimed. 2017; 24(4):91–96. DOI: 10.1109/MMUL.2017.4031311.
Le TT, Lin CY, Bin-Picking for Planar Objects Based on a Deep Learning Network: A Case Study of USB Packs. Sensors. 2019; 19(16) DOI: 10.3390/s19163602.
Correll N, et al., Analysis and Observations From the First Amazon Picking Challenge. IEEE Trans. Autom. Sci. Eng., 2018; 15(1):172–188, DOI: 10.1109/TASE.2016.2600527.
McLaughlin E, Charron N, Narasimhan S, Automated defect quantification in concrete bridges using robotics and deep learning. J. Comput. Civ. Eng. 2020; 34(5):4020029.
McLaughlin E, Charron N, Narasimhan S, Combining deep learning and robotics for automated concrete delamination assessment. in ISARC. Proceedings of the International Symposium on Automation and Robotics in Construction. 2019; 36:485–492.
How to Cite
Copyright (c) 2022 EAI Endorsed Transactions on AI and Robotics
This work is licensed under a Creative Commons Attribution 3.0 Unported License.
This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.
Jiangxi University of Science and Technology
Grant numbers 205200100460