Figure6
![Reinforcement learning with parameterized action space and sparse reward for UAV navigation](https://image.oaes.cc/5a85a66c-d57d-4c7d-81a0-cd7a69f1fdbe/5735-6.jpg)
Figure 6. Comparisons of the results using HER-MPDQN and other baselines. The left figure shows the periodic calculation of the task completion rate of the last 100 episodes; other descriptions for evaluating algorithm performance are the same as Figure 5.