Survey of Accuracy Prediction on the PlantVillage Dataset using different ML techniques


  • Vaishnavi Pandey Pranveer Singh Institute of Technology
  • Utkarsh Tripathi Pranveer Singh Institute of Technology
  • Vimal Kumar Singh Pranveer Singh Institute of Technology
  • Youvraj Singh Gaur Pranveer Singh Institute of Technology
  • Deepak Gupta Pranveer Singh Institute of Technology



AlexNet, Transfer Learning, Deep-Learning, Plant Disease Detection, GoogleNet, RCNN


A plant is susceptible to numerous illnesses while it is growing. The early detection of plant illnesses is one of the most serious problems in agriculture. Plant disease outbreaks may have a remarkable impact on crop yield, slowing the rate of the nation's economic growth. Early plant disease detection and treatment are possible using deep learning, computer-vision, and ML techniques. The methods used for the categorization of plant diseases even outperformed human performance and conventional image-processing-based methods. In this context, we review 48 works over the last five years that address problems with disease detection, dataset properties, the crops under study, and pathogens in various ways. The research results discussed in this paper, with a focus on work published between 2015 and 2023, demonstrate that among numerous techniques (MobileNetV2, K-Means+GLCM+SVM, Residual Teacher-Student CNN, SVM+K-Means+ANN, AlexNet, AlexNet with Learning from Scratch, AlexNet with Transfer Learning, VGG16, GoogleNet with Training from Scratch, GoogleNet with Transfer Learning) applied on the PlantVillage Dataset, the architecture AlexNet with Transfer Learning identified diseases with the highest accuracy.


Download data is not yet available.
<br data-mce-bogus="1"> <br data-mce-bogus="1">


Mathur, Archana S., Surajit Das, and Subhalakshmi Sircar. "Status of agriculture in India: trends and prospects." Economic and political weekly (2006): 5327-5336.

Nazarov, Pavel A., et al. "Infectious plant diseases: Etiology, current status, problems and prospects in plant protection." Acta naturae 12.3 (2020): 46. DOI:

Moshou, Dimitrios, et al. "Plant disease detection based on data fusion of hyper-spectral and multi-spectral fluorescence imaging using Kohonen maps." Real-Time Imaging 11.2 (2005): 75-83. DOI:

Naikwadi, Smita, and Niket Amoda. "Advances in image processing for detection of plant diseases." International Journal of Application or Innovation in Engineering & Management 2.11 (2013).

Dhaygude, Sanjay B., and Nitin P. Kumbhar. "Agricultural plant leaf disease detection using image processing." International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering 2, no. 1 (2013): 599-602.

Singh, Vijai, and A. K. Misra. "Detection of unhealthy region of plant leaves using image processing and genetic algorithm." 2015 International Conference on Advances in Computer Engineering and Applications. IEEE, 2015. DOI:

Mohanty, Sharada P., David P. Hughes, and Marcel Salathé. "Using deep learning for image-based plant disease detection." Frontiers in plant science 7 (2016): 1419. DOI:

Ferentinos, Konstantinos P. "Deep learning models for plant disease detection and diagnosis." Computers and electronics in agriculture 145 (2018): 311-318. DOI:

Rangarajan, Aravind Krishnaswamy, Raja Purushothaman, and Aniirudh Ramesh. "Tomato crop disease classification using pre-trained deep learning algorithm." Procedia computer science 133 (2018): 1040-1047. DOI:

Wu, Qiufeng, Yiping Chen, and Jun Meng. "DCGAN-based data augmentation for tomato leaf disease identification." IEEE Access 8 (2020): 98716-98728. DOI:

Zhang, Yang, Chenglong Song, and Dongwen Zhang. "Deep learning-based object detection improvement for tomato disease." IEEE access 8 (2020): 56607-56614. DOI:

Ai, Yong, et al. "Research on recognition model of crop diseases and insect pests based on deep learning in harsh environments." IEEE Access 8 (2020): 171686-171693. DOI:

V Suresh, V., et al. “Plant Disease Detection Using Image Processing.” International Journal of Engineering Research and Technology, vol. V9, no. 03, International Research Publication House, 13 Mar. 2020. DOI:

Tiwari, Divyansh, et al. "Potato leaf diseases detection using deep learning." 2020 4th international conference on intelligent computing and control systems (ICICCS). IEEE, 2020. DOI:

Nair, Arathi, et al. "Smart Farming and Plant Disease Detection using IoT and ML." International Journal of Engineering Research & Technology, NCREIS-2021 Conference Proceedings. 2021.

Ahmad, Mobeen, et al. "Plant disease detection in imbalanced datasets using efficient convolutional neural networks with stepwise transfer learning." IEEE Access 9 (2021): 140565-140580. DOI:

Thaiyalnayaki, K., and Christeena Joseph. "Classification of plant disease using SVM and deep learning." Materials Today: Proceedings 47 (2021): 468-470. DOI:

Trivedi, Naresh K., et al. "Early detection and classification of tomato leaf disease using high-performance deep neural network." Sensors 21.23 (2021): 7987. DOI:

Bedi, Punam, and Pushkar Gole. "Plant disease detection using hybrid model based on convolutional autoencoder and convolutional neural network." Artificial Intelligence in Agriculture 5 (2021): 90-101. DOI:

Chowdhury, Muhammad EH, et al. "Automatic and reliable leaf disease detection using deep learning techniques." AgriEngineering 3.2 (2021): 294-312. DOI:

Shrimali, Samyak. "Plantifyai: a novel convolutional neural network based mobile application for efficient crop disease detection and treatment." Procedia Computer Science 191 (2021): 469-474. DOI:

Harakannanavar, Sunil S., et al. "Plant leaf disease detection using computer vision and machine learning algorithms." Global Transitions Proceedings 3.1 (2022): 305-310. DOI:

Sunil, C. K., C. D. Jaidhar, and Nagamma Patil. "Cardamom plant disease detection approach using EfficientNetV2." IEEE Access 10 (2021): 789-804. DOI:

Ahmed, Sabbir, et al. "Less is more: Lighter and faster deep neural architecture for tomato leaf disease classification." IEEE Access 10 (2022): 68868-68884. DOI:

Özbılge, Emre, Mehtap Köse Ulukök, Önsen Toygar, and Ebru Ozbılge. "Tomato Disease Recognition Using a Compact Convolutional Neural Network." IEEE Access 10 (2022): 77213-77224. DOI:

Pandian, J. Arun, et al. "A five convolutional layer deep convolutional neural network for plant leaf disease detection." Electronics 11.8 (2022): 1266. DOI:

Kundu, Nidhi, Geeta Rani, Vijaypal Singh Dhaka, Kalpit Gupta, Siddaiah Chandra Nayaka, Eugenio Vocaturo, and Ester Zumpano. "Disease detection, severity prediction, and crop loss estimation in MaizeCrop using deep learning." Artificial Intelligence in Agriculture 6 (2022): 276-291. DOI:

Batchuluun, Ganbayar, Se Hyun Nam, and Kang Ryoung Park. "Deep learning-based plant classification and crop disease classification by thermal camera." Journal of King Saud University-Computer and Information Sciences 34, no. 10 (2022): 10474-10486. DOI:

Rani, Pushpa Athisaya Sakila, and N. Suresh Singh. "Paddy leaf symptom-based disease classification using deep CNN with ResNet-50." International Journal of Advanced Science Computing and Engineering 4.2 (2022): 88-94. DOI:

Zhao, Xiaohu, et al. "The continuous wavelet projections algorithm: A practical spectral-feature-mining approach for crop detection." The Crop Journal 10.5 (2022): 1264-1273. DOI:

Umamageswari, A., S. Deepa, and K. Raja. "An enhanced approach for leaf disease identification and classification using deep learning techniques." Measurement: Sensors 24 (2022): 100568. DOI:

Shah, Dhruvil, et al. "ResTS: Residual deep interpretable architecture for plant disease detection." Information Processing in Agriculture 9.2 (2022): 212-223. DOI:

Anwarul, Shahina, Manya Mohan, and Radhika Agarwal. "An Unprecedented Approach for Deep Learning Assisted Web Application to Diagnose Plant Disease." Procedia Computer Science 218 (2023): 1444-1453. DOI:

Bensaadi, Soumia, and Ahmed Louchene. "Low-cost convolutional neural network for tomato plant diseases classifiation." IAES International Journal of Artificial Intelligence 12.1 (2023): 162. DOI:

Meena, S. Divya, et al. "Crop Yield Improvement with Weeds, Pest and Disease Detection." Procedia Computer Science 218 (2023): 2369-2382. DOI:

Datta, Saikat, and Nitin Gupta. "A novel approach for the detection of tea leaf disease using deep neural network." Procedia Computer Science 218 (2023): 2273-2286. DOI:

Falaschetti, Laura, et al. "A CNN-based image detector for plant leaf diseases classification." HardwareX 12 (2022): e00363. DOI:

Anim-Ayeko, Alberta Odamea, Calogero Schillaci, and Aldo Lipani. "Automatic blight disease detection in potato (Solanum tuberosum L.) and tomato (Solanum lycopersicum, L. 1753) plants using deep learning." Smart Agricultural Technology 4 (2023): 100178. DOI:

Godtliebsen, Fred, James Stephen Marron, and Probal Chaudhuri. "Statistical significance of features in digital images." Image and Vision Computing 22.13 (2004): 1093-1104. DOI:

Callens, Nicolas, and Georges GE Gielen. "Analysis and comparison of readout architectures and analog-to-digital converters for 3D-stacked CMOS image sensors." IEEE Transactions on Circuits and Systems I: Regular Papers 68.8 (2021): 3117-3130. DOI:

Suzuki, Kenji. "Overview of deep learning in medical imaging." Radiological physics and technology 10.3 (2017): 257-273. DOI:

Marceau, D. J., and G. J. Hay. "Contributions of remote sensing to the scale issues." Canadian Journal of Remote Sensing 25.4 (1999): 357-366. DOI:

Amza, Catalin Gheorghe, and Dumitru Titi Cicic. "Industrial image processing using fuzzy-logic." Procedia Engineering 100 (2015): 492-498.

Amza, Catalin Gheorghe, and Dumitru Titi Cicic. "Industrial image processing using fuzzy-logic." Procedia Engineering 100 (2015): 492-498. DOI:

Mait, Joseph N., Gary W. Euliss, and Ravindra A. Athale. "Computational imaging." Advances in Optics and Photonics 10.2 (2018): 409-483. DOI:

Kandel, Ibrahem, and Mauro Castelli. "The effect of batch size on the generalizability of the convolutional neural networks on a histopathology dataset." ICT express 6.4 (2020): 312-315. DOI:

Yang, Jing, et al. "Interactive hierarchical dimension ordering, spacing and filtering for exploration of high dimensional datasets." IEEE Symposium on Information Visualization 2003 (IEEE Cat. No. 03TH8714). IEEE, 2003.

Coates, Adam, and Andrew Y. Ng. "The importance of encoding versus training with sparse coding and vector quantization." Proceedings of the 28th international conference on machine learning (ICML-11). 2011.

Gupta, Shivani, and Atul Gupta. "Dealing with noise problem in machine learning data-sets: A systematic review." Procedia Computer Science 161 (2019): 466-474. DOI:

Stasaski, Katherine, Grace Hui Yang, and Marti A. Hearst. "More diverse dialogue datasets via diversity-informed data collection." Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020. DOI:

Paullada, Amandalynne, et al. "Data and its (dis) contents: A survey of dataset development and use in machine learning research." Patterns 2.11 (2021). DOI:

Kjamilji, Artrim, Erkay Savaş, and Albert Levi. "Efficient secure building blocks with application to privacy preserving machine learning algorithms." IEEE Access 9 (2021): 8324-8353. DOI:

Nisa, Israt, et al. "Effective machine learning based format selection and performance modeling for SpMV on GPUs." 2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). IEEE, 2018. DOI:

Polignano, Marco, et al. "A comparison of word-embeddings in emotion detection from text using bilstm, cnn and self-attention." Adjunct Publication of the 27th Conference on User Modeling, Adaptation and Personalization. 2019. DOI:

Chen, Bodong, et al. "Two tales of time: Uncovering the significance of sequential patterns among contribution types in knowledge-building discourse." Interactive Learning Environments 25.2 (2017): 162-175. DOI:

Schmidt, Kenneth A., Sasha RX Dall, and Jan A. Van Gils. "The ecology of information: an overview on the ecological significance of making informed decisions." Oikos 119.2 (2010): 304-316. DOI:

Huang, Zhengxing, Xudong Lu, and Huilong Duan. "Context-aware recommendation using rough set model and collaborative filtering." Artificial Intelligence Review 35 (2011): 85-99. DOI:

Du, Bo, et al. "Exploring representativeness and informativeness for active learning." IEEE transactions on cybernetics 47.1 (2015): 14-26. DOI:

Dunnington, Dewey W., et al. "Comparing the Predictive performance, interpretability, and accessibility of machine learning and physically based models for water treatment." ACS ES&T Engineering 1.3 (2020): 348-356. DOI:

Houston, Andrew, and Georgina Cosma. "A genetically-optimised artificial life algorithm for complexity-based synthetic dataset generation." Information Sciences 619 (2023): 540-561. DOI:

Famili, A., et al. "Data preprocessing and intelligent data analysis." Intelligent data analysis 1.1 (1997): 3-23. DOI:

Dwivedi, Sanjay Kumar, and Bhupesh Rawat. "A review paper on data preprocessing: a critical phase in web usage mining process." 2015 International Conference on Green Computing and Internet of Things (ICGCIoT). IEEE, 2015. DOI:

García, Salvador, et al. "Big data preprocessing: methods and prospects." Big Data Analytics 1.1 (2016): 1-22. DOI:

Eler, Danilo Medeiros, et al. "Analysis of document pre-processing effects in text and opinion mining." Information 9.4 (2018): 100.0 DOI:

Agrawal, Shikha, and Jitendra Agrawal. "Survey on Anomaly Detection Using Data Mining Techniques." Procedia Computer Science, vol. 60, 2014, pp. 708-713, Accessed 20 Oct. 2023. DOI:

Dahouda, Mwamba Kasongo, and Inwhee Joe. "A deep-learned embedding technique for categorical features encoding." IEEE Access 9 (2021): 114381-114391. DOI:

Cha, Young‐Jin, et al. "Autonomous structural visual inspection using region‐based deep learning for detecting multiple damage types." Computer‐Aided Civil and Infrastructure Engineering 33(9): 731-747. DOI:

Takahashi, Ryo, Takashi Matsubara, and Kuniaki Uehara. "Data augmentation using random image cropping and patching for deep CNNs." IEEE Transactions on Circuits and Systems for Video Technology 30(9): 2917-2931. DOI:

Fadaee, Marzieh, Arianna Bisazza, and Christof Monz. "Data augmentation for low-resource neural machine translation." arXiv preprint arXiv:1705.00440 (2017). DOI:

Zini, Simone, et al. "Planckian Jitter: countering the color-crippling effects of color jitter on self-supervised training." arXiv preprint arXiv:2202.07993 (2022).

Eren Akbiyik, M. "Data Augmentation in Training CNNs: Injecting Noise to Images." arXiv e-prints (2023): arXiv-2307.

Ruth Fong, Mandela Patrick, Andrea Vedaldi; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, p. 2950-2958.

E. Castro, J. S. Cardoso and J. C. Pereira, "Elastic deformations for data augmentation in breast cancer mass detection," 2018 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI), Las Vegas, NV, USA, 2018, pp. 230-234, doi: 10.1109/BHI.2018.8333411. DOI:

Retrieved from

Retrieved from

Retrieved from

Retrieved from

Retrieved from

Boulent, Justine, et al. "Convolutional Neural Networks for the Automatic Identification of Plant Diseases." Frontiers in Plant Science, vol. 10, 2019, p. 464450, Accessed 20 Oct. 2023. DOI:

W. M. Jinjri, P. Keikhosrokiani and N. L. Abdullah, "Machine Learning Algorithms for The Classification of Cardiovascular Disease- A Comparative Study," 2021 International Conference on Information Technology (ICIT), Amman, Jordan, 2021, pp. 132-138, doi: 10.1109/ICIT52682.2021.9491677. DOI:

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. 2017. ImageNet classification with deep convolutional neural networks. Commun. ACM 60, 6 (June 2017), 84–90. DOI:

Simonyan, Karen, and Andrew Zisserman. "Very Deep Convolutional Networks for Large-Scale Image Recognition." ArXiv, 2014, /abs/1409.1556. Accessed 20 Oct. 2023.

Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, Andrew Rabinovich; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015, p. 1-9

Howard, Andrew G., et al. "MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications." ArXiv, 2017, /abs/1704.04861. Accessed 20 Oct. 2023.

Mark Sandler, Andrew Howard, Menglong Zhu, Andrey Zhmoginov, Liang-Chieh Chen; Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018, p. 4510-4520

Andrew Howard, Mark Sandler, Grace Chu, Liang-Chieh Chen, Bo Chen, Mingxing Tan, Weijun Wang, Yukun Zhu, Ruoming Pang, Vijay Vasudevan, Quoc V. Le, Hartwig Adam; Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2019, pp. 1314-1324

Tan, M. & Le, Q.. (2019). EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks.Proceedings of the 36th International Conference on Machine Learning, in Proceedings of Machine Learning Research 97:6105-6114 Available from

K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition," 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 770-778, doi: 10.1109/CVPR.2016.90. DOI:

D. R. Wilson and T. R. Martinez, "The need for small learning rates on large problems," IJCNN'01. International Joint Conference on Neural Networks. Proceedings (Cat. No.01CH37222), Washington, DC, USA, 2001, pp. 115-119 vol.1, doi: 10.1109/IJCNN.2001.939002. DOI:

Afaq, Saahil, and Smitha Rao. "Significance of epochs on training a neural network." Int. J. Sci. Technol. Res 9(06): 485-488.

Radiuk P. M. Impact of training set batch size on the performance of convolutional neural networks for diverse datasets // Information Technology and Management Science. 2017. Vol. 20, No 1. P. 20-24. DOI:

Yang, Xin-She. "Optimization algorithms." Computational optimization, methods and algorithms. Berlin, Heidelberg: Springer Berlin Heidelberg, 2011. 13-31. DOI:

Janocha, Katarzyna, and Wojciech M. Czarnecki. "On Loss Functions for Deep Neural Networks in Classification." ArXiv, 2017, /abs/1702.05659. Accessed 20 Oct. 2023.

Janocha, Katarzyna, and Wojciech M. Czarnecki. "On Loss Functions for Deep Neural Networks in Classification." ArXiv, 2017, /abs/1702.05659. Accessed 20 Oct. 2023.

Hossin, Mohammad, and Md Nasir Sulaiman. "A review on evaluation metrics for data classification evaluations." International journal of data mining & knowledge management process 5(2): 1.

Kumar, Pradumn, and Upasana Dugal. "Tensorflow based image classification using advanced convolutional neural network." International Journal of Recent Technology and Engineering (IJRTE) 8(6): 994-998. DOI:




How to Cite

V. Pandey, U. Tripathi, V. K. Singh, Y. S. Gaur, and D. Gupta, “Survey of Accuracy Prediction on the PlantVillage Dataset using different ML techniques ”, EAI Endorsed Trans IoT, vol. 10, Dec. 2023.