library_name: sample-factory | |
tags: | |
- deep-reinforcement-learning | |
- reinforcement-learning | |
- sample-factory | |
model-index: | |
- name: APPO | |
results: | |
- metrics: | |
- type: mean_reward | |
value: 9350.13 +/- 1.31 | |
name: mean_reward | |
task: | |
type: reinforcement-learning | |
name: reinforcement-learning | |
dataset: | |
name: mujoco_doublependulum | |
type: mujoco_doublependulum | |
A(n) **APPO** model trained on the **mujoco_doublependulum** environment. | |
This model was trained using Sample Factory 2.0: https://github.com/alex-petrenko/sample-factory | |