PPO agent for the lunar lander environment as part of the hugging face reinforcement learning course.
80da88a
- OS: Linux-5.15.109+-x86_64-with-glibc2.35 # 1 SMP Fri Jun 9 10:57:30 UTC 2023 | |
- Python: 3.10.12 | |
- Stable-Baselines3: 2.0.0a5 | |
- PyTorch: 2.0.1+cu118 | |
- GPU Enabled: True | |
- Numpy: 1.23.5 | |
- Cloudpickle: 2.2.1 | |
- Gymnasium: 0.28.1 | |
- OpenAI Gym: 0.25.2 | |