The Efficiency Cost-Sensitive Loss of Transformer based on Mamba Mechanism for Aircraft Detection in Satellite Imagery
DOI:
https://doi.org/10.4108/eetinis.v12i4.8279Keywords:
YOLO, Mamba, Transformer, Ghost Convolution, Wiou loss functionAbstract
Detecting aircraft in satellite images poses considerable challenges due to complex backgrounds and variable conditions influenced by sensor geometry and atmospheric factors. Despite rapid advancements in deep learning algorithms, their main focus has been on ground-based imagery. This study offers a thorough evaluation and comparison of advanced object detection algorithms specifically designed for aircraft detection in satellite imagery. By leveraging the extensive HRPlanesV2 dataset and a rigorous validation process on the GDIT dataset, we trained a cutting-edge object detection model, YOLO-Mamba, published in June 2024. Additionally, we introduce YOLO-Mamba-TransGhost, which integrates a novel Transformer module SC3T and Ghost Convolution into the YOLO model’s backbone architecture. Furthermore, substituting the WIoU loss function with CIoU in YOLO-Mamba results in significant improvements in accuracy and small object detection. Experimental results on the GDIT dataset indicate that YOLO-Mamba-TransGhost improves mAP@.5 by approximately 2% compared to the original YOLO-Mamba. Similarly, tests on the HRPlanev2 data set reveal a notable reduction in model complexity and an impressive accuracy of 98.7% which is achieved by leveraging a cost-sensitive loss function that dynamically focuses training on higher quality samples, improving convergence and accuracy. Therefore, the proposed YOLO-Mamba-TransGhost model demonstrates superior accuracy and reduced complexity in aircraft detection from satellite imagery, highlighting its potential for practical applications in aerospace monitoring, disaster management, and surveillance systems domain.
Downloads
References
[1] R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2014, pp. 580-587.
[2] R. Girshick, "Fast R-CNN," in Proceedings of the IEEE International Conference on Computer Vision, 2015, pp. 1440-1448.
[3] S. Ren, K. He, R. Girshick, and J. Sun, "Faster R- CNN: Towards real-time object detection with region proposal networks," IEEE transactions on pattern analysis and machine intelligence, vol. 39, no. 6, pp. 1137-1149, 2016.
[4] W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, and A. C. Berg, "SSD: Single shot multibox detector," in Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I, vol. 9905 of Lecture Notes in Computer Science, Springer, 2016, pp. 21-37.
[5] J. Terven, D. M. Córdova-Esparza, and J. A. Romero- González, "A comprehensive review of YOLO architec- tures in computer vision: From YOLOv1 to YOLOv8 and YOLO-NAS," Machine Learning and Knowledge Extrac- tion, vol. 5, no. 4, pp. 1680-1716, 2023.
[6] J. Redmon, "You only look once: Unified, real-time object detection," in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016.
[7] T. L. C. Tran, Z. C. Huang, K. H. Tseng, and P. H. Chou, "Detection of Bottle Marine Debris Using Unmanned Aerial Vehicles and Machine Learning Techniques," Drones, vol. 6, no. 12, p. 401, 2022.
[8] D. Xu and Y. Wu, "Improved YOLO-V3 with DenseNet for multi-scale remote sensing target detection," Sensors, vol. 20, no. 15, p. 4276, 2020.
[9] L. Zhou, H. Yan, Y. Shan, C. Zheng, Y. Liu, X. Zuo, and B. Qiao, "Aircraft detection for remote sensing images based on deep convolutional neural networks," Journal of Electrical and Computer Engineering, vol. 2021, no. 1, p. 4685644, 2021.
[10] Y. Yang, G. Xie, and Y. Qu, "Real-time detection of aircraft objects in remote sensing images based
on improved YOLOv4," in 2021 IEEE 5th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), 2021, pp. 1156-1164.
[11] Y. Kun, H. Man, and Y. L. Yanling, "Multi-target detection in airport scene based on YOLOv5," in 2021 IEEE 3rd International Conference on Civil Aviation Safety and Information Technology (ICCASIT), 2021, pp. 1175- 1177.
[12] Y. Li and X. Zhang, "Object detection for UAV images based on improved YOLOv6," IAENG International Journal of Computer Science, vol. 50, no. 2, pp. 759-768, 2023.
[13] M. T. Do, M. H. Ha, D. C. Nguyen, K. Thai, and Q. H. Do Ba, "Human Detection Based YOLO Backbones- Transformer in UAVs," in 2023 International Conference on System Science and Engineering (ICSSE), 2023, pp. 576- 580.
[14] M. Bakirci and I. Bayraktar, "Transforming aircraft detection through LEO satellite imagery and YOLOv9 for improved aviation safety," in 2024 26th International Conference on Digital Signal Processing and its Applications (DSPA), 2024, pp. 1-6.
[15] A. Wang, H. Chen, L. Liu, K. Chen, Z. Lin, J. Han, and G. Ding, "YOLOv10: Real-time end-to-end object detection," arXiv preprint arXiv:2405.14458, 2024.
[16] Z. Wang, C. Li, H. Xu, and X. Zhu, "Mamba YOLO: SSMs-Based YOLO For Object Detection," arXiv preprint arXiv:2406.05835, 2024.
[17] Z. Tong, Y. Chen, Z. Xu, and R. Yu, "Wise-IoU: bounding box regression loss with dynamic focusing mechanism," arXiv preprint arXiv:2301.10051, 2023.
[18]M. T. Do, T. D. Kim, M. H. Ha, O. T. C. Chen, D. C. Nguyen, and A. L. Q. Tran, "An Effective Method for Detecting Personal Protective Equipment at Real Construction Sites Using the Improved YOLOv5s with SIoU Loss Function," in 2023 RIVF International Conference on Computing and Communication Technologies (RIVF), 2023, pp. 430-434.
[19] K. Han, Y. Wang, Q. Tian, J. Guo, C. Xu, and C. Xu, "Ghostnet: More features from cheap operations," in Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2020, pp. 1580-1589.
[20] S. E. Ghazouali, A. Gucciardi, N. Venturi, M. Rueegseg- ger, and U. Michelucci, "FlightScope: A Deep Compre- hensive Assessment of Aircraft Detection Algorithms in Satellite Imagery," arXiv preprint arXiv:2404.02877, 2024.
[21] T.BakirmanandE.Sertel,"Abenchmarkdatasetfordeep learning-based airplane detection: HRPlanes," arXiv preprint arXiv:2204.10959, 2022.
[22] G. H. Rishi, P. R. Kumar, N. C. Varshit, and K. Sailaja, "Real-Time Military Aircraft Detection using YOLOv5," in 2024 Second International Conference on Data Science and Information System (ICDSIS), 2024, pp. 1-7.
[23] Medium, "YOLO-NAS: A game-changer in object detec- tion with Deci AI’s neural architecture search technol- ogy," Medium, 2024.
Downloads
Published
How to Cite
Issue
Section
Categories
License
Copyright (c) 2025 Manh-Tuan Do, Manh-Hung Ha, Minh-Huy Le, Oscal Tzyh-Chiang

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This is an open-access article distributed under the terms of the Creative Commons Attribution CC BY 3.0 license, which permits unlimited use, distribution, and reproduction in any medium so long as the original work is properly cited.