Complex Engineering Systems

Search Log In

Complex Engineering Systems

Figure3

From: Stability-preserving automatic tuning of PID control with reinforcement learning

Stability-preserving automatic tuning of PID control with reinforcement learning

Figure 3. The structure of the actor and critic networks. Left: The actor network where layer normalization is used before each network layer. Decaying noise is added to the output to encourage exploration at the beginning of RL training. Right: The critic network that consumes state and action, and returns the $$ Q $$ value for updates with policy gradient.

Complex Engineering Systems

ISSN 2770-6249 (Online)

[email protected]

Navigation

Follow Us

Navigation

Committee on Publication Ethics

https://members.publicationethics.org/members/complex-engineering-systems

Portico

All published articles are preserved here permanently:

https://www.portico.org/publishers/oae/

Committee on Publication Ethics

https://members.publicationethics.org/members/complex-engineering-systems

Portico

All published articles are preserved here permanently:

https://www.portico.org/publishers/oae/

[email protected]

Discover Content

Language Editing

Layout & Production

Graphical Abstracts

Video Abstracts

Conference Organizer

Strategic Collaborators

Follow OAE

© 2016-2025 OAE Publishing Inc., except certain content provided by third parties

Privacy Cookies Terms of Service