| library_name: stable-baselines3 | |
| tags: | |
| - LunarLander-v2 | |
| - deep-reinforcement-learning | |
| - reinforcement-learning | |
| - stable-baselines3 | |
| model-index: | |
| - name: PPO_v1 | |
| results: | |
| - metrics: | |
| - type: mean_reward | |
| value: 226.29 +/- 14.66 | |
| name: mean_reward | |
| task: | |
| type: reinforcement-learning | |
| name: reinforcement-learning | |
| dataset: | |
| name: LunarLander-v2 | |
| type: LunarLander-v2 | |
| # **PPO_v1** Agent playing **LunarLander-v2** | |
| This is a trained model of a **PPO_v1** agent playing **LunarLander-v2** using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3). | |
| ## Usage (with Stable-baselines3) | |
| TODO: Add your code | |