PPO agent for the lunar lander environment as part of the hugging face reinforcement learning course.

80da88a over 1 year ago

248 Bytes

	- OS: Linux-5.15.109+-x86_64-with-glibc2.35 # 1 SMP Fri Jun 9 10:57:30 UTC 2023
	- Python: 3.10.12
	- Stable-Baselines3: 2.0.0a5
	- PyTorch: 2.0.1+cu118
	- GPU Enabled: True
	- Numpy: 1.23.5
	- Cloudpickle: 2.2.1
	- Gymnasium: 0.28.1
	- OpenAI Gym: 0.25.2