DeepRL-PPO-LLv2 / results.json
0x05a4's picture
Baseline 1M epochs
6e56ef6
raw
history blame
165 Bytes
{"mean_reward": 263.46341418885197, "std_reward": 22.886157125894723, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2022-05-14T16:54:38.754838"}