DeepRL-PPO-LLv2 / results.json
0x05a4's picture
Baseline: LR=3e-4/.99, epochs=2e6
f588a6c
raw
history blame
164 Bytes
{"mean_reward": 256.70422000804786, "std_reward": 15.77447780951087, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-14T18:23:38.330754"}