Figure7
From: A distributed multi-vehicle pursuit scheme: generative multi-adversarial reinforcement learning

Figure 7. Convergence Process During Training. (A): Convergence Process of Average Reward for Methods in P4-E2. (B): Convergence Process of Average Reward for Methods in P5-E3. (C): Convergence Process of Average Reward for Methods in P7-E4.