Effective Object detection and Tracking using Attention-Driven YOLO v9 Model with Multi-Stage Cascaded Convolutional Model
DOI:
https://doi.org/10.4108/eetiot.8231Keywords:
Object Detection, Tracking, Vehicle, CNN, YOLO v9, Deep learning, Attention mechanism, Spatial mechanismAbstract
INTRODUCTION: Object detection and tracking are essential for computer vision, particularly for vehicle monitoring within digital images and video streams. Traditional methods, such as background subtraction and template matching, rely on heuristic algorithm and handcrafted features, which often struggles with diverse vehicle appearance and complex backgrounds. These techniques, while foundational, exhibit limitations in flexibility and scalability, resulting in lower accuracy and high computational costs.
OBJECTIVES: In contrast, advanced Deep Learning (DL) approaches, particularly those utilizing Conventional Neural Network (CNNs), have revolutionized the field by enabling automatic feature extraction from large datasets. Despite their advantages, existing DL models like You Only Look Once (YOLO) face challenges in detecting small or closely packed vehicles and can be computationally intensive.
METHODS: This study proposed an Attention Driven YOLO v9 architecture that integrates with a proposed mechanism combining spatial and channel attention to detect the small size vehicle accurately.
RESULTS: Additionally the architecture incorporates multi stage cascaded convolution layers to enhance the feature extraction and robustness against occlusion and background noise. The model is trained using the UA-DETRAC dataset, providing a rich set of images for learning.
CONCLUSION: Performance evaluation metric such as Mean Average Precision (mAP), precision, recall, and tracking accuracy demonstrating significant improvement over traditional methods and existing state of the art models. This research contributes to the field by addressing the limitations of previous studies through technique to speed and accuracy in vehicle detection and tracking.
Downloads
References
[1] M. T. Hosain, A. Zaman, M. R. Abir, S. Akter, S. Mursalin, and S. S. Khan, "Synchronizing Object Detection: Applications, Advancements and Existing Challenges," IEEE Access, 2024.
[2] A. S. Patel, R. Vyas, O. Vyas, M. Ojha, and V. Tiwari, "Motion-compensated online object tracking for activity detection and crowd behavior analysis," The Visual Computer, vol. 39, no. 5, pp. 2127-2147, 2023.
[3] H. Alqahtani and G. Kumar, "Machine learning for enhancing transportation security: A comprehensive analysis of electric and flying vehicle systems," Engineering Applications of Artificial Intelligence, vol. 129, p. 107667, 2024.
[4] S. Kurian, M. Faiz, M. Zubair, and S. Ahamed, "Advancing Safety and Efficiency in Human-Robot Interaction," in 2024 International Conference on Distributed Computing and Optimization Techniques (ICDCOT), 2024: IEEE, pp. 1-8.
[5] S. Kanagamalliga, P. Kovalan, K. Kiran, and S. Rajalingam, "Traffic Management through Cutting-Edge Vehicle Detection, Recognition, and Tracking Innovations," Procedia Computer Science, vol. 233, pp. 793-800, 2024.
[6] L. Zhang, W. Xu, C. Shen, and Y. Huang, "Vision-based on-road nighttime vehicle detection and tracking using improved HOG features," Sensors, vol. 24, no. 5, p. 1590, 2024.
[7] V. K. Patil, P. Nawade, R. Nagarkar, and P. Kadale, "Object Detection and Tracking Face Detection and Recognition," Integrating Metaheuristics in Computer Vision for Real‐World Optimization Problems, pp. 25-54, 2024.
[8] D.-y. Ge, X.-f. Yao, W.-j. Xiang, and Y.-p. Chen, "Vehicle detection and tracking based on video image processing in intelligent transportation system," Neural Computing and Applications, vol. 35, no. 3, pp. 2197-2209, 2023.
[9] M. Zohaib, M. Asim, and M. ELAffendi, "Enhancing Emergency Vehicle Detection: A Deep Learning Approach with Multimodal Fusion," Mathematics, vol. 12, no. 10, p. 1514, 2024.
[10] M. Rasheed, M. Kaleem, M. A. Mushtaq, N. Ahmed, S. Rasheed, and S. Batool, "Deep Learning Approaches for Classification of Vehicles in Intelligent Transportation Systems," 2024.
[11] T. Diwan, G. Anirudh, and J. V. Tembhurne, "Object detection using YOLO: Challenges, architectural successors, datasets and applications," multimedia Tools and Applications, vol. 82, no. 6, pp. 9243-9275, 2023.
[12] A. A. Mustapha and M. S. Yoosuf, "Exploring the efficacy and comparative analysis of one-stage object detectors for computer vision: a review," Multimedia Tools and Applications, vol. 83, no. 20, pp. 59143-59168, 2024.
[13] A. Thakur and S. K. Mishra, "An in-depth evaluation of deep learning-enabled adaptive approaches for detecting obstacles using sensor-fused data in autonomous vehicles," Engineering Applications of Artificial Intelligence, vol. 133, p. 108550, 2024.
[14] J. Zhai, B. Li, S. Lv, and Q. Zhou, "FPGA-based vehicle detection and tracking accelerator," Sensors, vol. 23, no. 4, p. 2208, 2023.
[15] M. Carranza-García, J. Torres-Mateo, P. Lara-Benítez, and J. García-Gutiérrez, "On the performance of one-stage and two-stage object detectors in autonomous vehicles using camera data," Remote Sensing, vol. 13, no. 1, p. 89, 2020.
[16] B. Karbouj, G. A. Topalian-Rivas, and J. Krüger, "Comparative Performance Evaluation of One-Stage and Two-Stage Object Detectors for Screw Head Detection and Classification in Disassembly Processes," Procedia CIRP, vol. 122, pp. 527-532, 2024.
[17] N. G. Rani, N. H. Priya, A. Ahilan, and N. Muthukumaran, "LV-YOLO: Logistic vehicle speed detection and counting using deep learning based YOLO network," Signal, Image and Video Processing, vol. 18, no. 10, pp. 7419-7429, 2024.
[18] J. Wilson and A. York, "Enhancing Road Traffic Surveillance: Deep Learning Techniques for Vehicle Detection and Tracking," Journal of Computer Technology and Software, vol. 1, no. 1, pp. 5-9, 2024.
[19] U. Mittal, P. Chawla, and R. Tiwari, "EnsembleNet: A hybrid approach for vehicle detection and estimation of traffic density based on faster R-CNN and YOLO models," Neural Computing and Applications, vol. 35, no. 6, pp. 4755-4774, 2023.
[20] K. SP and P. Mohandas, "DETR-SPP: a fine-tuned vehicle detection with transformer," Multimedia Tools and Applications, vol. 83, no. 9, pp. 25573-25594, 2024.
[21] Y. Sakagawa, K. Nakajima, and G. Ohashi, "Vision based nighttime vehicle detection using adaptive threshold and multi-class classification," IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, vol. 102, no. 9, pp. 1235-1245, 2019.
[22] Z. Qiu, H. Bai, and T. Chen, "Special vehicle detection from UAV perspective via YOLO-GNS based deep learning network," Drones, vol. 7, no. 2, p. 117, 2023.
[23] A. Farid, F. Hussain, K. Khan, M. Shahzad, U. Khan, and Z. Mahmood, "A fast and accurate real-time vehicle detection method using deep learning for unconstrained environments," Applied Sciences, vol. 13, no. 5, p. 3059, 2023.
[24] T. Barbu, S.-I. Bejinariu, and R. Luca, "Automatic Vehicle Detection and Tracking using Cascade Classifiers and CNN-based Multi-scale Feature Extraction."
[25] X. Han, "Modified cascade RCNN based on contextual information for vehicle detection," Sensing and Imaging, vol. 22, no. 1, p. 19, 2021.
[26] P. Jagannathan, S. Rajkumar, J. Frnda, P. B. Divakarachari, and P. Subramani, "Moving vehicle detection and classification using gaussian mixture model and ensemble deep learning technique," Wireless Communications and Mobile Computing, vol. 2021, no. 1, p. 5590894, 2021.
[27] Y. Zhang, Z. Guo, J. Wu, Y. Tian, H. Tang, and X. Guo, "Real-time vehicle detection based on improved yolo v5," Sustainability, vol. 14, no. 19, p. 12274, 2022.
[28] J. Kim, S. Hong, and E. Kim, "Novel on-road vehicle detection system using multi-stage convolutional neural network," IEEE Access, vol. 9, pp. 94371-94385, 2021.
[29] D. M. Chilukuri, S. Yi, and Y. Seong, "A robust object detection system with occlusion handling for mobile devices," Computational Intelligence, vol. 38, no. 4, pp. 1338-1364, 2022.
[30] J. Wang, Y. Dong, S. Zhao, and Z. Zhang, "A high-precision vehicle detection and tracking method based on the attention mechanism," Sensors, vol. 23, no. 2, p. 724, 2023.
[31] Z. Liu, W. Han, H. Xu, K. Gong, Q. Zeng, and X. Zhao, "Research on vehicle detection based on improved YOLOX_S," Scientific reports, vol. 13, no. 1, p. 23081, 2023.
[32] S. Sutikno, A. Sugiharto, R. Kusumaningrum, and H. A. Wibawa, "Improved car detection performance on highways based on YOLOv8," Bulletin of Electrical Engineering and Informatics, vol. 13, no. 5, pp. 3526-3533, 2024.
[33] M. M. Rana, M. S. Hossain, M. M. Hossain, and M. D. Haque, "Improved vehicle detection: unveiling the potential of modified YOLOv5," Discover Applied Sciences, vol. 6, no. 7, p. 332, 2024.
[34] J. He, H. Chen, B. Liu, S. Luo, and J. Liu, "Enhancing YOLO for occluded vehicle detection with grouped orthogonal attention and dense object repulsion," Scientific Reports, vol. 14, no. 1, p. 19650, 2024.
[35] T. Deng, X. Liu, and L. Wang, "Occluded vehicle detection via multi-scale hybrid attention mechanism in the road scene," Electronics, vol. 11, no. 17, p. 2709, 2022.
[36] M. A. Z. Al Bayati and M. Çakmak, "Real-Time Vehicle Detection for Surveillance of River Dredging Areas Using Convolutional Neural Networks," Int. J. Image Graph. Signal Process, vol. 15, pp. 17-28, 2023.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Krishna Mohan A, P. V. N. Reddy, K. Satyma Prasad

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This is an open-access article distributed under the terms of the Creative Commons Attribution CC BY 3.0 license, which permits unlimited use, distribution, and reproduction in any medium so long as the original work is properly cited.