Voltage Fluctuation Control Strategy Based on Reinforcement Learning Digital Twin Model for Wind-Solar-Water-Storage Smart Microgrid Energy Storage Converter Side

Authors

  • Erbao Yan Guangzhou Power Supply Bureau of Guangdong Power Grid Co., Ltd.

DOI:

https://doi.org/10.4108/ew.12110

Keywords:

New energy, Microgrid, Voltage fluctuation control, Deep reinforcement learning, Proximal policy optimization

Abstract

With the increasing penetration of distributed renewable energy (DRE), the stable operation of smart microgrids integrating wind, solar, hydro, and storage faces significant challenges. The core issue lies in the strong intermittency and random-ness of distributed generation outputs, which can easily trigger voltage fluctuations, particularly at the grid connection points of Power Conversion Systems (PCS), seriously threatening power quality and supply reliability. This study innovatively integrates Deep Reinforcement Learning and Digital Twin (DRL+DT) technologies to establish a novel control framework, which constructs a digital twin of the microgrid. Based on the Proximal Policy Optimization (PPO) algorithm, a DRL controller is designed, with clearly defined state spaces, action spaces, and a composite reward function incorporating multi-objective constraints. This modeling approach transforms the voltage control problem into a sequential decision-making task solvable through data-driven methods. The trained policy neural network serves as an intelligent controller for performance comparison in subsequent sections. To validate the proposed approach, simulation tests verify that the proposed method suppresses voltage fluctuations more rapidly and smoothly compared to conventional PI control and single DRL approaches. The voltage deviation is reduced by 62.4%, while the State of Charge (SOC) of the energy storage system is effectively maintained within a healthy range. The experimental results not only verify the technical advantages of the combined DRL+DT framework in addressing complex energy control challenges in microgrids but also demonstrate its capability to support the development of highly resilient and self-healing smart microgrid control systems, highlighting both advanced functionality and practical utility.

Downloads

Download data is not yet available.

References

[1] Evangeline S I, Baskaran K, Darwin S. Minimizing voltage fluctuation in stand-alone microgrid system using a Kriging-based multi-objective stochastic optimization algorithm [J]. Electrical Engineering, 2024, 106(6). DOI:10.1007/s00202-024-02497-3.

[2] Li D, Ren L, Liu F, et al. Two-time scale microgrid scheduling based on power fluctuation mitigation priority and model predictive control [J]. Energy, 2025, 324. DOI:10.1016/j.energy.2025.135760.

[3] Nandi R, Tripathy M, Gupta C P. Advanced Adaptive Virtual Impedance Based Dual Mode Inverter Controller for Power and Voltage Coordination in LV AC Microgrid [J]. IEEE Transactions on Industry Applications, 2024 (6 Pt.1): 60. DOI:10.1109/TIA.2024.3443777.

[4] Shayeghi H, Rahnama A, Bizon N. TFODn-FOPI multi-stage controller design to maintain an islanded microgrid load-frequency balance considering responsive loads support [J]. IET generation, transmission & distribution, 2023.DOI:10.1049/gtd2.12898.

[5] Liu W, Pei J, Ye Y, et al. Prescribed-performance-based adaptive fractional-order sliding mode control for ship DC microgrid [J]. Ocean Engineering, 2024, 311(Part2): 8. DOI:10.1016/j.oceaneng.2024.118885.

[6] Farrokhi E, Ghoreishy H, Ahmadiahangar R .Real-time optimal power management for a hybrid energy storage system with battery thermal consideration and DC microgrid current estimation capability [J]. Electrical Engineering, 2024, 106(4). DOI:10.1007/s00202-024-02243-9.

[7] Shi K, Yu Z, Xu P, et al. A novel adaptive control strategy for DC microgrids with additional virtual inertia [J]. Electrical Engineering, 2025, 107(8):10777-10788. DOI:10.1007/s00202-025-03060-4.

[8] Kumari P , Pan S .Enhanced load frequency control using predictive reduced order generalized active disturbance rejection control under communication delay and cyber-attack[J].Electrical Engineering, 2025, 107(3).DOI:10.1007/s00202-024-02713-0.

[9] Govindasamy S , Balapattabi S R , Kaliappan B ,et al.Energy management in microgrids using IoT considering uncertainties of renewable energy sources and electric demands: GBDT-JS approach[J].Electrical engineering, 2023(6):105.DOI:10.1007/s00202-023-01947-8.

[10] Guo J , Zhang S , Wang S .Research on nonlinear robust control strategy for active support of grid-forming photovoltaic frequency[J].The European Physical Journal Plus, 2025, 140(5):1-16.DOI:10.1140/epjp/s13360-025-06316-x.

[11] Sasidhar P R S , Gebremedhin A , Norheim I .Multi-energy microgrid design and the role of coupling components—A review[J].Renewable and Sustainable Energy Reviews, 2025, 216.DOI:10.1016/j.rser.2025.115540.

[12] Luo D , Li Z , Yan Y ,et al.Performance analysis and optimization of an annular thermoelectric generator integrated with vapor chambers[J].Energy, 2024, 307. DOI:10.1016/j.energy.2024.132565.

[13] Sahil M , Prasenjit B .Determination of microgrid stability index based on measured electrical parameters and Mamdani fuzzy inference system[J].Electrical engineering, 2024, 106(1):581-601.DOI:10.1007/s00202-023-02002-2.

[14] Chen F , Zhao Z , He X ,et al.Quantification of abnormal characteristics and flow-patterns identification in pumped storage system[J].Nonlinear Dynamics, 2024, 112(23):20813-20848. DOI:10.1007/s11071-024-10131-x.

[15] Parisio A , Rikos E , Glielmo L .A Model Predictive Control Approach to Microgrid Operation Optimization[J].IEEE Transactions on Control Systems Technology, 2014, 22(5):1813-1827. DOI:10.1109/TCST.2013.2295737.

[16] Muduli R , Jena D , Moger T .Automatic generation control of is-landed micro-grid using integral reinforcement learning-based adaptive optimal control strategy[J].Electrical Engineering, 2025, 107(3). DOI:10.1007/s00202-024-02648-6.

[17] Bhuyan M , Das D C , Barik A K . Chaotic butterfly optimization algorithm based cascaded PI‐TID controller for frequency control in three area hybrid microgrid system[J].Optimal Control - Applications & Methods, 2023, 44(5). DOI:10.1002/oca.2994.

[18] Casagrande V, Ferianc M, Rodrigues M, et al. Learning-based MPC with uncertainty estimation for resilient microgrid energy management[J].IFAC PapersOnLine, 2024, 58(4):556-561. DOI:10.1016/j.ifacol.2024.07.277.

[19] Abdulwahid A H , Wang S .A new differential protection scheme for microgrid using Hilbert space based power setting and fuzzy decision processes[C]//Industrial Electronics & Applications.IEEE, 2016:6-11.DOI:10.1109/ICIEA.2016.7603542.

[20] [20] Li S , Qiu Y , Huangfu L Y .Net Power Optimization Based on Extremum Search and Model-Free Adaptive Control of PEMFC Power Generation System for High Altitude[J].IEEE transactions on transportation electrification, 2023, 9(4):5151-5164.

[21] Momen H , Jadid S .A novel microgrid formation strategy for resilience enhancement considering energy storage systems based on deep reinforcement learning[J].Journal of Energy Storage, 2024, 100.DOI:10.1016/j.est.2024.113565.

[22] Aboras K M , Alshehri M H , Megahed A I .Eel and Grouper Optimization-Based Fuzzy FOPI-TIDμ-PIDA Controller for Frequency Management of Smart Microgrids Under the Impact of Communication Delays and Cyberattacks [J]. Mathematics (2227-7390), 2025, 13(13). DOI:10.3390/math13132040.

[23] Zheng Y , Wang Y , Meng X ,et al.Distributed Economic MPC for Synergetic Regul-ation of the Voltage of an Island DC Micro-Grid[J].Acta Automatica Sinica, 2024(3). DOI:10.1109/JAS.2023.123750.

[24] Wang C , Zhang J , Wang A ,et al.Prioritized sum-tree experience replay TD3 DRL-based online energy management of a residential microgrid[J].Applied Energy, 2024, 368.DOI:10.1016/j.apenergy.2024.123471.

[25] Musarrat M N , Fekih A ,Md. Ashib RahmanMd. Rabiul IslamKashem M. Muttaqi.An Event Triggered Sliding Mode Control-Based Fault Ride-Through Scheme to Improve the Transient Stability of Wind Energy Systems[J].IEEE Transactions on Industry Applications, 2024, 60(1 Pt.1):876-886.DOI:10.1109/TIA.2023.3328851.

[26] Cagnano A .Can Integrating SoC Management in Economic Dispatch Enhance Real-Time Operation of a Microgrid?[J].Energies (19961073), 2025, 18(7).DOI:10.3390/en18071802.

[27] Chen C , Xia R , Hu H .A novel multi-classification method for photovoltaic electricity theft behavior with low false detection rate[J].Measurement, 2025, 239(000):115461.DOI:10.1016/j.measurement.2024.115461.

[28] Wais D S , Majeed W S , Mahmood W K M .Microgrid Utilizing Solar Power System with Pulse Width Modulation[J].IOP Publishing Ltd, 2025.DOI:10.1088/1755-1315/1507/1/012011.

[29] Fadoul F F , Hassan A A , Alar R .Integrating autoencoder and decision tree models for enhanced energy consumption forecasting in microgrids: A meteorological data-driven approach in Djibouti[J].Results in Engineering, 2024, 24.DOI:10.1016/j.rineng.2024.103033.

[30] Srivastava D , Narayanan V , Singh B ,et al.A Novel Steepest Descent Least Mean Square Control for Smooth Mode Transfer of a Single-Stage SPVA-BES Hybrid Microgrid [J]. IEEE Transactions on Industry Applications. 2024, 60(5-Part1): 14. DOI:10.1109/TIA.2024.3412868.

Downloads

Published

15-04-2026

How to Cite

1.
Yan E. Voltage Fluctuation Control Strategy Based on Reinforcement Learning Digital Twin Model for Wind-Solar-Water-Storage Smart Microgrid Energy Storage Converter Side. EAI Endorsed Trans Energy Web [Internet]. 2026 Apr. 15 [cited 2026 Apr. 15];12. Available from: https://publications.eai.eu/index.php/ew/article/view/12110