Figure 9. Average reward curve during the training process.
All published articles are preserved here permanently: