dtiapkin's picture
PPO with default parameters
26814d9
{"mean_reward": 276.8343995059675, "std_reward": 15.552994276073212, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-02-21T20:53:55.235851"}