FDD-YOLO: A Lightweight Multi-scale Prohibited Items Detection Model
DOI:
https://doi.org/10.4108/airo.10277Keywords:
Frequency Domain Decomposition Network (FDDN), Deformable Elastic Fusion Pyramid (DEFP), Dual-channel Convolution (DualConv), Prohibited Items, X-ray ImageAbstract
X-ray security inspection faces challenges such as severe occlusion, scale variation, and complex background when detecting prohibited items, requiring real-time and accurate detection. Although the YOLO series of models has high inference efficiency, they suffer from problems such as feature redundancy, insufficient fine-grained feature extraction, and limited adaptability to overlapping objects. To overcome these limitations, we propose FDD-YOLO and design three novel modules: (1) The Frequency Domain Decomposition Network (FDDN) in the backbone network enhances the edges of metal objects and the contours of liquid containers by decomposing high-frequency and low-frequency features while reducing computational redundancy; (2) The Deformable Elastic Fusion Pyramid (DEFP) in the neck network adopts dynamic channel allocation and multi-scale deformable convolution to handle the geometric changes of folded and overlapping objects; (3) The lightweight Dual-channel Convolution (DualConv) improves multi-scale feature capture through grouping and point-by-point convolution, thereby reducing the number of parameters while improving the accuracy of small object detection. Tests on the SIXray, HIXray, and private GIX datasets show that FDD-YOLO achieves 2.6%, 3.2%, and 8.6% higher mAP than YOLOv11n, respectively, achieving accuracies of 94.8%, 84%, and 71.8%, respectively. This framework also reduces the number of parameters by 30.6% and the number of FLOPs by 26.9%, achieving an optimal balance between accuracy and efficiency, setting a new technical benchmark for real-time security inspections.
Downloads
References
[1] Li M, Jia T, Wang H, et al. Ao-detr: Anti-overlapping detr for x-ray prohibited items detection[J]. IEEE Transactions on Neural Networks and Learning Systems, 2024.
[2] Hassan T, Bettayeb M, Akçay S, et al. Detecting prohibited items in X-ray images: A contour proposal learning approach[C]//2020 IEEE International Conference on Image Processing (ICIP). IEEE, 2020: 2016-2020.
[3] Hättenschwiler N, Sterchi Y, Mendes M, et al. Automation in airport security X-ray screening of cabin baggage: Examining benefits and possible implementations of automated explosives detection[J]. Applied ergonomics, 2018, 72: 58-68.
[4] Ren S, He K, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2016, 39(6): 1137-1149.
[5] Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779-788.
[6] Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21-37.
[7] Ou X, Chen X, Xu X, et al. Recent development in x-ray imaging technology: Future and challenges[J]. Research, 2021.
[8] Wang H, Jia T, Ma B, et al. Delving into cluttered prohibited item detection for security inspection system[J]. IEEE Transactions on Industrial Informatics, 2024, 20(10): 11825-11834.
[9] Lin T Y, Dollár P, Girshick R, et al. Feature pyramid networks for object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 2117-2125.
[10] Wang M, Du H, Mei W, et al. Weight-guided dual-direction-fusion feature pyramid network for prohibited item detection in x-ray images[J]. Journal of Electronic Imaging, 2022, 31(3): 033032-033032.
[11] Guo M H, Xu T X, Liu J J, et al. Attention mechanisms in computer vision: A survey[J]. Computational visual media, 2022, 8(3): 331-368.
[12] Abbasi S, Mohammadzadeh M, Zamzamian M. A novel dual high-energy X-ray imaging method for materials discrimination[J]. Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, 2019, 930: 82-86.
[13] Xie X, Cheng G, Wang J, et al. Oriented R-CNN and beyond[J]. International Journal of Computer Vision, 2024, 132(7): 2420-2442.
[14] Zhang W, Zhu Q, Li Y, et al. MAM Faster R-CNN: Improved Faster R-CNN based on Malformed Attention Module for object detection on X-ray security inspection[J]. Digital Signal Processing, 2023, 139: 104072.
[15] Sagar A S M S, Chen Y, Xie Y K, et al. MSA R-CNN: A comprehensive approach to remote sensing object detection and scene understanding[J]. Expert Systems with Applications, 2024, 241: 122788.
[16] Zhang H, Teng W, He X, et al. Lightweight prohibited items detection model in X-ray images based on improved YOLOv7-tiny[J]. Journal of the Franklin Institute, 2025, 362(1): 107421.
[17] Guan F, Zhang H, Wang X. An improved YOLOv8 model for prohibited item detection with deformable convolution and dynamic head[J]. Journal of Real-Time Image Processing, 2025, 22(2): 84.
[18] Zhao C, Zhu L, Dou S, et al. Detecting overlapped objects in X-ray security imagery by a label-aware mechanism[J]. IEEE transactions on information forensics and security, 2022, 17: 998-1009.
[19] Ding J, Ye C, Wang H, et al. Foreign bodies detector based on detr for high-resolution x-ray images of textiles[J]. IEEE Transactions on Instrumentation and Measurement, 2023, 72: 1-10.
[20] Zhou Y, Xu X, Wang R. EI-YOLO: Efficiently Improved YOLO on Detection of Prohibited Items During Security Inspections[C]//Chinese Conference on Pattern Recognition and Computer Vision (PRCV). Singapore: Springer Nature Singapore, 2024: 330-343.
[21] Zhou Y T, Cao K Y, Li D, et al. Fine-YOLO: a simplified X-ray prohibited object detection network based on feature aggregation and normalized Wasserstein distance[J]. Sensors, 2024, 24(11): 3588.
[22] Jia L, Wang T, Chen Y, et al. MobileNet-CA-YOLO: An improved YOLOv7 based on the MobileNetV3 and attention mechanism for Rice pests and diseases detection[J]. Agriculture, 2023, 13(7): 1285.
[23] Zhong J, Chen J, Mian A. DualConv: Dual convolutional kernels for lightweight deep neural networks[J]. IEEE Transactions on Neural Networks and Learning Systems, 2022, 34(11): 9528-9535.
[24] Srivastava H, Sarawadekar K. A depthwise separable convolution architecture for CNN accelerator[C]//2020 IEEE Applied Signal Processing Conference (ASPCON). IEEE, 2020: 1-5.
[25] Miao C, Xie L, Wan F, et al. Sixray: A large-scale security inspection x-ray benchmark for prohibited item discovery in overlapping images[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019: 2119-2128.
[26] Tao R, Wei Y, Jiang X, et al. Towards real-world X-ray security inspection: A high-quality benchmark and lateral inhibition module for prohibited items detection[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2021: 10923-10932.
[27] He K, Gkioxari G, Dollár P, et al. Mask r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2961-2969.
[28] X. Lu, B. Li, Y. Yue, Q. Li, J. Yan, Grid R-CNN, in: Proc. IEEE Conf. Comput. Vis.Pattern Recog, IEEE, Long Beach, CA, USA, 2019, pp. 7355–7364, https://doi.org/10.1109/CVPR.2019.00754.
[29] Ma C, Zhuo L, Li J, et al. Occluded prohibited object detection in X-ray images with global context-aware multi-scale feature aggregation[J]. Neurocomputing, 2023, 519: 1-16.
[30] Peng J, Lv K, Wang G, et al. MLSA-YOLO: a multi-level feature fusion and scale-adaptive framework for small object detection[J]. The Journal of Supercomputing, 2025, 81(4): 528.
[31] Wang A, Chen H, Liu L, et al. Yolov10: Real-time end-to-end object detection[J]. Advances in Neural Information Processing Systems, 2024, 37: 107984-108011.
[32] Khanam R, Hussain M. Yolov11: An overview of the key architectural enhancements[J]. arXiv preprint arXiv:2410.17725, 2024.
[33] Tian Y, Ye Q, Doermann D. Yolov12: Attention-centric real-time object detectors[J]. arXiv preprint arXiv:2502.12524, 2025.
[34] Huang S, Lu Z, Cun X, et al. Deim: Detr with improved matching for fast convergence[C]//Proceedings of the Computer Vision and Pattern Recognition Conference. 2025: 15162-15171.
Downloads
Published
Issue
Section
Categories
License
Copyright (c) 2025 Zilong Xue, Bo Wang, Yuanwei Xie, Zhibin Li, Xiaozheng Fan, Chenyoukang Lin, Peiyang Wei, Linlin Chen, Xun Deng, Jianhong Gan

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.