Figure2
![Deep reinforcement learning for real-world quadrupedal locomotion: a comprehensive review](https://image.oaes.cc/12bbd333-926b-4682-b8bc-e9945c452687/5115-2.jpg)
Figure 2. A common paradigm for DRL-based quadrupedal locomotion research. This paradigm is mainly divided into training and testing phases. The policy interacts with the simulated environment and collects data for iterative updates, and then the trained policy is deployed to the real robot.