salohiddin94
/

a2c-PandaReachDense-v3

Reinforcement Learning

stable-baselines3

PandaReachDense-v3

deep-reinforcement-learning

Model card Files Files and versions Community

salohiddin94 commited on Oct 15, 2023

Commit

5010926

·

1 Parent(s): 658a2fa

Update README.md

Files changed (1) hide show

README.md +20 -2

README.md CHANGED Viewed

@@ -30,8 +30,26 @@ TODO: Add your code
 ```python
-from stable_baselines3 import ...
-from huggingface_sb3 import load_from_hub
 ...
 ```

 ```python
+from stable_baselines3.common.vec_env import DummyVecEnv, VecNormalize
+# Load the saved statistics
+eval_env = DummyVecEnv([lambda: gym.make("PandaReachDense-v3")])
+eval_env = VecNormalize.load("vec_normalize.pkl", eval_env)
+# We need to override the render_mode
+eval_env.render_mode = "rgb_array"
+#  do not update them at test time
+eval_env.training = False
+# reward normalization is not needed at test time
+eval_env.norm_reward = False
+# Load the agent
+model = A2C.load("a2c-PandaReachDense-v3")
+mean_reward, std_reward = evaluate_policy(model, eval_env)
+print(f"Mean reward = {mean_reward:.2f} +/- {std_reward:.2f}")
 ...
 ```