ppo-LunarLander-v2 / results.json
Mtc2's picture
First reinforcement learning model on Hugging face
49c68fc
raw
history blame
157 Bytes
{"mean_reward": 272.7818194, "std_reward": 21.65887332426571, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2023-06-11T19:35:37.567548"}