ppo-LunarLander-v2 / results.json
tripathysagar's picture
Intial commit after traning the stuff on PPO for 9e6 steps
1abb2a4
raw
history blame
164 Bytes
{"mean_reward": 300.4308471157418, "std_reward": 10.612499711530152, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-12-29T12:45:40.208196"}