tags: | |
- LunarLander-v2 | |
- ppo | |
- deep-reinforcement-learning | |
- reinforcement-learning | |
- custom-implementation | |
- deep-rl-course | |
model-index: | |
- name: PPO | |
results: | |
- task: | |
type: reinforcement-learning | |
name: reinforcement-learning | |
dataset: | |
name: LunarLander-v2 | |
type: LunarLander-v2 | |
metrics: | |
- type: mean_reward | |
value: -226.85 +/- 113.36 | |
name: mean_reward | |
verified: false | |
# PPO Agent Playing LunarLander-v2 | |
This is a trained model of a PPO agent playing LunarLander-v2. | |
# Hyperparameters | |