FDD-YOLO: A Lightweight Multi-scale Prohibited Items Detection Model

Zilong Xue; Bo Wang; Yuanwei Xie; Zhibin Li; Xiaozheng Fan; Chenyoukang Lin; Peiyang Wei; Linlin Chen; Xun Deng; Jianhong Gan

doi:10.4108/airo.10277

Authors

Zilong Xue Xinjiang University
Bo Wang Xinjiang University
Yuanwei Xie Xinjiang University
Zhibin Li Chengdu University of Information Technology , Sichuan University of Arts and Science
Xiaozheng Fan Xinjiang University
Chenyoukang Lin Xinjiang University
Peiyang Wei Chengdu University of Information Technology , Sichuan University of Arts and Science
Linlin Chen Chengdu University of Information Technology , Sichuan University of Arts and Science
Xun Deng Chengdu University of Information Technology , Sichuan University of Arts and Science
Jianhong Gan Chengdu University of Information Technology , Sichuan University of Arts and Science

DOI:

https://doi.org/10.4108/airo.10277

Keywords:

Frequency Domain Decomposition Network (FDDN), Deformable Elastic Fusion Pyramid (DEFP), Dual-channel Convolution (DualConv), Prohibited Items, X-ray Image

Abstract

X-ray security inspection faces challenges such as severe occlusion, scale variation, and complex background when detecting prohibited items, requiring real-time and accurate detection. Although the YOLO series of models has high inference efficiency, they suffer from problems such as feature redundancy, insufficient fine-grained feature extraction, and limited adaptability to overlapping objects. To overcome these limitations, we propose FDD-YOLO and design three novel modules: (1) The Frequency Domain Decomposition Network (FDDN) in the backbone network enhances the edges of metal objects and the contours of liquid containers by decomposing high-frequency and low-frequency features while reducing computational redundancy; (2) The Deformable Elastic Fusion Pyramid (DEFP) in the neck network adopts dynamic channel allocation and multi-scale deformable convolution to handle the geometric changes of folded and overlapping objects; (3) The lightweight Dual-channel Convolution (DualConv) improves multi-scale feature capture through grouping and point-by-point convolution, thereby reducing the number of parameters while improving the accuracy of small object detection. Tests on the SIXray, HIXray, and private GIX datasets show that FDD-YOLO achieves 2.6%, 3.2%, and 8.6% higher mAP than YOLOv11n, respectively, achieving accuracies of 94.8%, 84%, and 71.8%, respectively. This framework also reduces the number of parameters by 30.6% and the number of FLOPs by 26.9%, achieving an optimal balance between accuracy and efficiency, setting a new technical benchmark for real-time security inspections.

Downloads

Download data is not yet available.

References

[1] Li M, Jia T, Wang H, et al. Ao-detr: Anti-overlapping detr for x-ray prohibited items detection[J]. IEEE Transactions on Neural Networks and Learning Systems, 2024.

[2] Hassan T, Bettayeb M, Akçay S, et al. Detecting prohibited items in X-ray images: A contour proposal learning approach[C]//2020 IEEE International Conference on Image Processing (ICIP). IEEE, 2020: 2016-2020.

[3] Hättenschwiler N, Sterchi Y, Mendes M, et al. Automation in airport security X-ray screening of cabin baggage: Examining benefits and possible implementations of automated explosives detection[J]. Applied ergonomics, 2018, 72: 58-68.

[4] Ren S, He K, Girshick R, et al. Faster R-CNN: Towards real-time object detection with region proposal networks[J]. IEEE transactions on pattern analysis and machine intelligence, 2016, 39(6): 1137-1149.

[5] Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 779-788.

[6] Liu W, Anguelov D, Erhan D, et al. Ssd: Single shot multibox detector[C]//Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer International Publishing, 2016: 21-37.

[7] Ou X, Chen X, Xu X, et al. Recent development in x-ray imaging technology: Future and challenges[J]. Research, 2021.

[8] Wang H, Jia T, Ma B, et al. Delving into cluttered prohibited item detection for security inspection system[J]. IEEE Transactions on Industrial Informatics, 2024, 20(10): 11825-11834.

[9] Lin T Y, Dollár P, Girshick R, et al. Feature pyramid networks for object detection[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2017: 2117-2125.

[10] Wang M, Du H, Mei W, et al. Weight-guided dual-direction-fusion feature pyramid network for prohibited item detection in x-ray images[J]. Journal of Electronic Imaging, 2022, 31(3): 033032-033032.

[11] Guo M H, Xu T X, Liu J J, et al. Attention mechanisms in computer vision: A survey[J]. Computational visual media, 2022, 8(3): 331-368.

[12] Abbasi S, Mohammadzadeh M, Zamzamian M. A novel dual high-energy X-ray imaging method for materials discrimination[J]. Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, 2019, 930: 82-86.

[13] Xie X, Cheng G, Wang J, et al. Oriented R-CNN and beyond[J]. International Journal of Computer Vision, 2024, 132(7): 2420-2442.

[14] Zhang W, Zhu Q, Li Y, et al. MAM Faster R-CNN: Improved Faster R-CNN based on Malformed Attention Module for object detection on X-ray security inspection[J]. Digital Signal Processing, 2023, 139: 104072.

[15] Sagar A S M S, Chen Y, Xie Y K, et al. MSA R-CNN: A comprehensive approach to remote sensing object detection and scene understanding[J]. Expert Systems with Applications, 2024, 241: 122788.

[16] Zhang H, Teng W, He X, et al. Lightweight prohibited items detection model in X-ray images based on improved YOLOv7-tiny[J]. Journal of the Franklin Institute, 2025, 362(1): 107421.

[17] Guan F, Zhang H, Wang X. An improved YOLOv8 model for prohibited item detection with deformable convolution and dynamic head[J]. Journal of Real-Time Image Processing, 2025, 22(2): 84.

[18] Zhao C, Zhu L, Dou S, et al. Detecting overlapped objects in X-ray security imagery by a label-aware mechanism[J]. IEEE transactions on information forensics and security, 2022, 17: 998-1009.

[19] Ding J, Ye C, Wang H, et al. Foreign bodies detector based on detr for high-resolution x-ray images of textiles[J]. IEEE Transactions on Instrumentation and Measurement, 2023, 72: 1-10.

[20] Zhou Y, Xu X, Wang R. EI-YOLO: Efficiently Improved YOLO on Detection of Prohibited Items During Security Inspections[C]//Chinese Conference on Pattern Recognition and Computer Vision (PRCV). Singapore: Springer Nature Singapore, 2024: 330-343.

[21] Zhou Y T, Cao K Y, Li D, et al. Fine-YOLO: a simplified X-ray prohibited object detection network based on feature aggregation and normalized Wasserstein distance[J]. Sensors, 2024, 24(11): 3588.

[22] Jia L, Wang T, Chen Y, et al. MobileNet-CA-YOLO: An improved YOLOv7 based on the MobileNetV3 and attention mechanism for Rice pests and diseases detection[J]. Agriculture, 2023, 13(7): 1285.

[23] Zhong J, Chen J, Mian A. DualConv: Dual convolutional kernels for lightweight deep neural networks[J]. IEEE Transactions on Neural Networks and Learning Systems, 2022, 34(11): 9528-9535.

[24] Srivastava H, Sarawadekar K. A depthwise separable convolution architecture for CNN accelerator[C]//2020 IEEE Applied Signal Processing Conference (ASPCON). IEEE, 2020: 1-5.

[25] Miao C, Xie L, Wan F, et al. Sixray: A large-scale security inspection x-ray benchmark for prohibited item discovery in overlapping images[C]//Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2019: 2119-2128.

[26] Tao R, Wei Y, Jiang X, et al. Towards real-world X-ray security inspection: A high-quality benchmark and lateral inhibition module for prohibited items detection[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2021: 10923-10932.

[27] He K, Gkioxari G, Dollár P, et al. Mask r-cnn[C]//Proceedings of the IEEE international conference on computer vision. 2017: 2961-2969.

[28] X. Lu, B. Li, Y. Yue, Q. Li, J. Yan, Grid R-CNN, in: Proc. IEEE Conf. Comput. Vis.Pattern Recog, IEEE, Long Beach, CA, USA, 2019, pp. 7355–7364, https://doi.org/10.1109/CVPR.2019.00754.

[29] Ma C, Zhuo L, Li J, et al. Occluded prohibited object detection in X-ray images with global context-aware multi-scale feature aggregation[J]. Neurocomputing, 2023, 519: 1-16.

[30] Peng J, Lv K, Wang G, et al. MLSA-YOLO: a multi-level feature fusion and scale-adaptive framework for small object detection[J]. The Journal of Supercomputing, 2025, 81(4): 528.

[31] Wang A, Chen H, Liu L, et al. Yolov10: Real-time end-to-end object detection[J]. Advances in Neural Information Processing Systems, 2024, 37: 107984-108011.

[32] Khanam R, Hussain M. Yolov11: An overview of the key architectural enhancements[J]. arXiv preprint arXiv:2410.17725, 2024.

[33] Tian Y, Ye Q, Doermann D. Yolov12: Attention-centric real-time object detectors[J]. arXiv preprint arXiv:2502.12524, 2025.

[34] Huang S, Lu Z, Cun X, et al. Deim: Detr with improved matching for fast convergence[C]//Proceedings of the Computer Vision and Pattern Recognition Conference. 2025: 15162-15171.

FDD-YOLO: A Lightweight Multi-scale Prohibited Items Detection Model

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

Categories

License

How to Cite

Most read articles by the same author(s)

Make a Submission

Latest publications

Information