DeepRLCourse2022 / bguan_ppo_lunarlander /_stable_baselines3_version

Commit History

bguan's lunar lander model using PPO trained for 500K timesteps
807c5ec

bguan commited on