Wind turbine pitch reinforcement learning control improved by PID regulator and learning observer

Sierra-Garcia, J. Enrique and Santos, Matilde and Pandit, Ravi (2022) Wind turbine pitch reinforcement learning control improved by PID regulator and learning observer. Engineering Applications of Artificial Intelligence, 111. p. 104769. ISSN 0952-1976

[img]
Preview
Text
Published Version
Available under the following license: Creative Commons Attribution Non-commercial No Derivatives.

Download (2MB) | Preview
Official URL: https://doi.org/10.1016/j.engappai.2022.104769

Abstract

Wind turbine (WT) pitch control is a challenging issue due to the non-linearities of the wind device and its complex dynamics, the coupling of the variables and the uncertainty of the environment. Reinforcement learning (RL) based control arises as a promising technique to address these problems. However, its applicability is still limited due to the slowness of the learning process. To help alleviate this drawback, in this work we present a hybrid RL-based control that combines a RL-based controller with a proportional–integral–derivative (PID) regulator, and a learning observer. The PID is beneficial during the first training episodes as the RL based control does not have any experience to learn from. The learning observer oversees the learning process by adjusting the exploration rate and the exploration window in order to reduce the oscillations during the training and improve convergence. Simulation experiments on a small real WT show how the learning significantly improves with this control architecture, speeding up the learning convergence up to 37%, and increasing the efficiency of the intelligent control strategy. The best hybrid controller reduces the error of the output power by around 41% regarding a PID regulator. Moreover, the proposed intelligent hybrid control configuration has proved more efficient than a fuzzy controller and a neuro-control strategy.

Item Type: Journal Article
Keywords: Intelligent control, Reinforcement learning, Learning observer, Pitch control, Wind turbines
Faculty: Faculty of Science & Engineering
Depositing User: Lisa Blanshard
Date Deposited: 16 Mar 2022 16:59
Last Modified: 31 May 2022 16:18
URI: https://arro.anglia.ac.uk/id/eprint/707407

Actions (login required)

Edit Item Edit Item