trained model for ALE/Galaxian-v5 using PPO

Files changed (11) hide show

PPO-Galaxian-v5.zip ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0dd8057a866d8cdebc6695bea5b61dbb697efb94cc9b57003dcc6c47486a5a51
+size 140080722

PPO-Galaxian-v5/_stable_baselines3_version ADDED Viewed

	@@ -0,0 +1 @@


1	+ 2.3.2

PPO-Galaxian-v5/data ADDED Viewed

The diff for this file is too large to render. See raw diff

PPO-Galaxian-v5/policy.optimizer.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:44aeee408780c61e5f48bb03adb9b1562cd47253769e0a62b0801d3940aa734e
+size 92924714

PPO-Galaxian-v5/policy.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:3b2180a598991b838dc100485ec6a459afce0ed656d3e71502e46788cb1ff9ba
+size 46464498

PPO-Galaxian-v5/pytorch_variables.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ebdad4b9cfe9cd22a3abadb5623bf7bb1f6eb2e408740245eb3f2044b0adc018
+size 864

PPO-Galaxian-v5/system_info.txt ADDED Viewed

+- OS: macOS-14.5-arm64-arm-64bit Darwin Kernel Version 23.5.0: Wed May  1 20:19:05 PDT 2024; root:xnu-10063.121.3~5/RELEASE_ARM64_T8112
+- Python: 3.9.6
+- Stable-Baselines3: 2.3.2
+- PyTorch: 2.3.0
+- GPU Enabled: False
+- Numpy: 1.26.4
+- Cloudpickle: 3.0.0
+- Gymnasium: 0.29.1

README.md ADDED Viewed

+---
+library_name: stable-baselines3
+tags:
+- ALE/Galaxian-v5
+- deep-reinforcement-learning
+- reinforcement-learning
+- stable-baselines3
+model-index:
+- name: PPO
+  results:
+  - task:
+      type: reinforcement-learning
+      name: reinforcement-learning
+    dataset:
+      name: ALE/Galaxian-v5
+      type: ALE/Galaxian-v5
+    metrics:
+    - type: mean_reward
+      value: 940.00 +/- 0.00
+      name: mean_reward
+      verified: false
+---
+# **PPO** Agent playing **ALE/Galaxian-v5**
+This is a trained model of a **PPO** agent playing **ALE/Galaxian-v5**
+using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
+## Usage (with Stable-baselines3)
+TODO: Add your code
+```python
+from stable_baselines3 import ...
+from huggingface_sb3 import load_from_hub
+...
+```

config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

replay.mp4 ADDED Viewed

Binary file (266 kB). View file

results.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"mean_reward": 940.0, "std_reward": 0.0, "is_deterministic": false, "n_eval_episodes": 10, "eval_datetime": "2024-05-21T12:57:31.252524"}