Object Detection and Segmentation of Power Equipment in Infrared Images via Improved YOLOv8 and Prompt-Optimized SAM

Authors

  • Bing Xue State Grid Shaanxi Electric Power Company Limited Weinan Power Supply Company https://orcid.org/0009-0002-7106-9454
  • Zehui Liu State Grid Shaanxi Electric Power Company Limited Weinan Power Supply Company
  • Zhanhong Wang State Grid Shaanxi Electric Power Company Limited Weinan Power Supply Company
  • Wenyuan Zhou State Grid Shaanxi Electric Power Company Limited Weinan Power Supply Company
  • Baoning Wang State Grid Shaanxi Electric Power Company Limited Weinan Power Supply Company
  • Xukun Yang State Grid Shaanxi Electric Power Company Limited Weinan Power Supply Company

DOI:

https://doi.org/10.4108/eetsis.10727

Keywords:

Infrared Images, Power Equipment, Object Detection, Image Segmentation, YOLOv8

Abstract

To achieve automated infrared monitoring of power equipment in substations, this paper proposes an object detection and segmentation method based on improved YOLOv8 and Prompt-Optimized SAM (Segment Anything Model). Firstly, to address the issues of poor resolution and strong background interference in infrared images, the small object feature extraction capability and bounding box regression accuracy of YOLOv8 are improved by introducing a multi-scale feature extraction module, a robust feature downsampling module, and an improved loss function. The Spatial Pyramid Pooling Fast module is improved using large-kernel depthwise separable convolution, enhancing the extraction capability for both global and local features. Secondly, to improve segmentation accuracy, this paper proposes a method that converts detection boxes into prompt points. GrabCut, combined with colour saliency and a superpixel algorithm, is used to segment high-confidence target regions. Zero-shot prompt point segmentation for SAM is achieved by performing clustering on the regions. Experimental validation on an infrared dataset covering seven types of power equipment shows that the improved object detection model achieves an mAP@0.5 of 95.7%, which is 2.3% higher than the original model, with a detection speed of 107.5 FPS. The proposed segmentation method achieves higher accuracy in complex backgrounds than both bounding box-prompted SAM and GrabCut. This study lays a foundation for the precise processing of infrared images of substation power equipment.

References

[1] Usamentiaga R, Fernandez MA, Villan AF, Carus JL. Temperature Monitoring for Electrical Substations Using Infrared Thermography: Architecture for Industrial Internet of Things. IEEE Trans Ind Inform. 2018;14(12):5667–5677.

[2] Liu ZQ, Fu H, Li YJ, Zhang GJ, Hu CB, Zhang ZH. Infrared Image Power Equipment Detection Based on Mask-RCNN Transfer Learning. J Data Acquis Process. 2021;36(1):176–183.

[3] Liu X, Zhang Z, Hao Y, Zhao H, Yang Y. Optimized OTSU Segmentation Algorithm-Based Temperature Feature Extraction Method for Infrared Images of Electrical Equipment. Sensors. 2024;24(4):1126.

[4] Jadin MS, Taib S. Recent Progress in Diagnosing the Reliability of Electrical Equipment by Using Infrared Thermography. Infrared Phys Technol. 2012;55(3):236–245.

[5] Ou JH, Wang JG, Xue J, Wang JP, Zhou X, She LG, Fan YD. Infrared Image Target Detection of Substation Electrical Equipment Using an Improved Faster R-CNN. IEEE Trans Power Del. 2023;38(1):387–396.

[6] Zhang L, Kuang J, Teng Y, Xiang S, Li L, Zhou Y. A Lightweight Infrared and Visible Light Multimodal Fusion Method for Object Detection in Power Inspection. Processes. 2025;13(9):2720.

[7] Xu C, Li Q, Jiang X, Yu D, Zhou Y. Dual-Space Graph-Based Interaction Network for RGB-Thermal Semantic Segmentation in Electric Power Scene. IEEE Trans Circuits Syst Video Technol. 2023;33(4):1577–1592.

[8] Wang B, Dong M, Ren M, Wu ZY, Guo CX, Zhuang TX, Pischler O, Xie JC. Automatic Fault Diagnosis of Infrared Insulator Images Based on Image Instance Segmentation and Temperature Analysis. IEEE Trans Instrum Meas. 2020;69(8):5345–5355.

[9] Kirillov A, Mintun E, Ravi N, Mao H, Rolland C, Gustafson L, Xiao T, Whitehead S, Berg AC, Lo WY, Dollár P, Girshick R. Segment Anything. In: Proceedings of the 2023 IEEE/CVF International Conference on Computer Vision (ICCV); 2023 October 1-6; Paris, France. Piscataway, NJ: IEEE; 2023. p. 3992–4003.

[10] Ultralytics LLC. YOLOv8 Documentation Release 8.0. San Francisco, CA, USA: Ultralytics LLC; 2023.

[11] Wu T, Dong Y. YOLO-SE: Improved YOLOv8 for Remote Sensing Object Detection and Recognition. Appl Sci. 2023;13(24):12977.

[12] Ma M, Pang H. SP-YOLOv8s: An Improved YOLOv8s Model for Remote Sensing Image Tiny Object Detection. Appl Sci. 2023;13(14):8161.

[13] Wang G, Chen Y, An P, Hong H, Hu J, Huang T. UAV-YOLOv8: A Small-Object-Detection Model Based on Improved YOLOv8 for UAV Aerial Photography Scenarios. Sensors. 2023;23(16):7190.

[14] Rother C, Kolmogorov V, Blake A. GrabCut: Interactive Foreground Extraction Using Iterated Graph Cuts. ACM Trans Graph. 2004;23(3):309–314.

[15] Dan B, Li M, Tang T, Zhang J. One Shot Is Enough for Sequential Infrared Small Target Segmentation. In: Proceedings of the 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP); 2025 April 6-11; Hyderabad, India. Piscataway, NJ: IEEE; 2025. p. 1–5.

[16] Li Y, Wang D, Yuan C, Li H, Hu J. Enhancing Agricultural Image Segmentation with an Agricultural Segment Anything Model Adapter. Sensors. 2023;23(18):7884.

[17] Ma X, Li Y. Edge-Aided Multiscale Context Network for Infrared Small Target Detection. IEEE Geosci Remote Sens Lett. 2023;20(1):1–5.

[18] Wang T, Zhang J, Ren B, Liu B. MMW-YOLOv5: A Multi-Scale Enhanced Traffic Sign Detection Algorithm. IEEE Access. 2024;12(1):148880–148892.

[19] Achanta R, Shaji A, Smith K, Lucchi A, Fua P, Süsstrunk S. SLIC Superpixels Compared to State-of-the-Art Superpixel Methods. IEEE Trans Pattern Anal Mach Intell. 2012;34(11):2274–2282.

[20] Hartigan JA, Wong MA. Algorithm AS 136: A K-Means Clustering Algorithm. J R Stat Soc Ser C (Appl Stat). 1979;28(1):100–108.

Downloads

Published

12-03-2026

Issue

Section

Scheduling optimization and load balancing in scalable distributed systems

How to Cite

1.
Xue B, Liu Z, Wang Z, Zhou W, Wang B, Yang X. Object Detection and Segmentation of Power Equipment in Infrared Images via Improved YOLOv8 and Prompt-Optimized SAM. EAI Endorsed Scal Inf Syst [Internet]. 2026 Mar. 12 [cited 2026 Mar. 14];12(7). Available from: https://publications.eai.eu/index.php/sis/article/view/10727