Object Detection and Segmentation of Power Equipment in Infrared Images via Improved YOLOv8 and Prompt-Optimized SAM
DOI:
https://doi.org/10.4108/eetsis.10727Keywords:
Infrared Images, Power Equipment, Object Detection, Image Segmentation, YOLOv8Abstract
To achieve automated infrared monitoring of power equipment in substations, this paper proposes an object detection and segmentation method based on improved YOLOv8 and Prompt-Optimized SAM (Segment Anything Model). Firstly, to address the issues of poor resolution and strong background interference in infrared images, the small object feature extraction capability and bounding box regression accuracy of YOLOv8 are improved by introducing a multi-scale feature extraction module, a robust feature downsampling module, and an improved loss function. The Spatial Pyramid Pooling Fast module is improved using large-kernel depthwise separable convolution, enhancing the extraction capability for both global and local features. Secondly, to improve segmentation accuracy, this paper proposes a method that converts detection boxes into prompt points. GrabCut, combined with colour saliency and a superpixel algorithm, is used to segment high-confidence target regions. Zero-shot prompt point segmentation for SAM is achieved by performing clustering on the regions. Experimental validation on an infrared dataset covering seven types of power equipment shows that the improved object detection model achieves an mAP@0.5 of 95.7%, which is 2.3% higher than the original model, with a detection speed of 107.5 FPS. The proposed segmentation method achieves higher accuracy in complex backgrounds than both bounding box-prompted SAM and GrabCut. This study lays a foundation for the precise processing of infrared images of substation power equipment.
References
[1] Usamentiaga R, Fernandez MA, Villan AF, Carus JL. Temperature Monitoring for Electrical Substations Using Infrared Thermography: Architecture for Industrial Internet of Things. IEEE Trans Ind Inform. 2018;14(12):5667–5677.
[2] Liu ZQ, Fu H, Li YJ, Zhang GJ, Hu CB, Zhang ZH. Infrared Image Power Equipment Detection Based on Mask-RCNN Transfer Learning. J Data Acquis Process. 2021;36(1):176–183.
[3] Liu X, Zhang Z, Hao Y, Zhao H, Yang Y. Optimized OTSU Segmentation Algorithm-Based Temperature Feature Extraction Method for Infrared Images of Electrical Equipment. Sensors. 2024;24(4):1126.
[4] Jadin MS, Taib S. Recent Progress in Diagnosing the Reliability of Electrical Equipment by Using Infrared Thermography. Infrared Phys Technol. 2012;55(3):236–245.
[5] Ou JH, Wang JG, Xue J, Wang JP, Zhou X, She LG, Fan YD. Infrared Image Target Detection of Substation Electrical Equipment Using an Improved Faster R-CNN. IEEE Trans Power Del. 2023;38(1):387–396.
[6] Zhang L, Kuang J, Teng Y, Xiang S, Li L, Zhou Y. A Lightweight Infrared and Visible Light Multimodal Fusion Method for Object Detection in Power Inspection. Processes. 2025;13(9):2720.
[7] Xu C, Li Q, Jiang X, Yu D, Zhou Y. Dual-Space Graph-Based Interaction Network for RGB-Thermal Semantic Segmentation in Electric Power Scene. IEEE Trans Circuits Syst Video Technol. 2023;33(4):1577–1592.
[8] Wang B, Dong M, Ren M, Wu ZY, Guo CX, Zhuang TX, Pischler O, Xie JC. Automatic Fault Diagnosis of Infrared Insulator Images Based on Image Instance Segmentation and Temperature Analysis. IEEE Trans Instrum Meas. 2020;69(8):5345–5355.
[9] Kirillov A, Mintun E, Ravi N, Mao H, Rolland C, Gustafson L, Xiao T, Whitehead S, Berg AC, Lo WY, Dollár P, Girshick R. Segment Anything. In: Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision (ICCV); 2023 October 1-6; Paris, France. Piscataway, NJ: IEEE; 2023. p. 3992–4003.
[10] Ultralytics LLC. YOLOv8 Documentation Release 8.0. San Francisco, CA, USA: Ultralytics LLC; 2023.
[11] Wu T, Dong Y. YOLO-SE: Improved YOLOv8 for Remote Sensing Object Detection and Recognition. Appl Sci. 2023;13(24):12977.
[12] Ma M, Pang H. SP-YOLOv8s: An Improved YOLOv8s Model for Remote Sensing Image Tiny Object Detection. Appl Sci. 2023;13(14):8161.
[13] Wang G, Chen Y, An P, Hong H, Hu J, Huang T. UAV-YOLOv8: A Small-Object-Detection Model Based on Improved YOLOv8 for UAV Aerial Photography Scenarios. Sensors. 2023;23(16):7190.
[14] Rother C, Kolmogorov V, Blake A. GrabCut: Interactive Foreground Extraction Using Iterated Graph Cuts. ACM Trans Graph. 2004;23(3):309–314.
[15] Dan B, Li M, Tang T, Zhang J. One Shot Is Enough for Sequential Infrared Small Target Segmentation. In: Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2025 April 6-11; Hyderabad, India. Piscataway, NJ: IEEE; 2025. p. 1–5.
[16] Li Y, Wang D, Yuan C, Li H, Hu J. Enhancing Agricultural Image Segmentation with an Agricultural Segment Anything Model Adapter. Sensors. 2023;23(18):7884.
[17] Ma X, Li Y. Edge-Aided Multiscale Context Network for Infrared Small Target Detection. IEEE Geosci Remote Sens Lett. 2023;20(1):1–5.
[18] Wang T, Zhang J, Ren B, Liu B. MMW-YOLOv5: A Multi-Scale Enhanced Traffic Sign Detection Algorithm. IEEE Access. 2024;12(1):148880–148892.
[19] Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Süsstrunk S. SLIC Superpixels Compared to State-of-the-Art Superpixel Methods. IEEE Trans Pattern Anal Mach Intell. 2012;34(11):2274–2282.
[20] Hartigan JA, Wong MA. Algorithm AS 136: A K-Means Clustering Algorithm. J R Stat Soc Ser C (Appl Stat). 1979;28(1):100–108.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 Bing Xue, Zehui Liu, Zhanhong Wang, Wenyuan Zhou, Baoning Wang, Xukun Yang

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
This is an open access article distributed under the terms of the CC BY-NC-SA 4.0, which permits copying, redistributing, remixing, transformation, and building upon the material in any medium so long as the original work is properly cited.
