ppo-Lander_test / results.json
slarionne's picture
First attempt at training PPO Lunar Lander. Purposely making it basic
36065c2
raw
history blame
164 Bytes
{"mean_reward": -250.5470592558733, "std_reward": 99.16591098098986, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-08-13T18:32:37.493547"}