My trained model

Files changed (10) hide show

PPO_model_v1.zip ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c9ef3df4a3722a1888b529f7bf7fe1e3e6a39ea38a83914b6df6bdaa7e9a813b
+size 29447806

PPO_model_v1/_stable_baselines3_version ADDED Viewed

	@@ -0,0 +1 @@


1	+ 2.3.2

PPO_model_v1/data ADDED Viewed

The diff for this file is too large to render. See raw diff

PPO_model_v1/policy.optimizer.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0c00f66e642f5c8dbe88596a829504d05729dd7ecdaa5237d82dfcd6d878716b
+size 19518205

PPO_model_v1/policy.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ce988d2d967e0cebb8f2b7728388214a34630d8faa0198abdb2a04fd725b6248
+size 9761059

PPO_model_v1/pytorch_variables.pth ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ebdad4b9cfe9cd22a3abadb5623bf7bb1f6eb2e408740245eb3f2044b0adc018
+size 864

PPO_model_v1/system_info.txt ADDED Viewed

+- OS: macOS-14.6.1-arm64-i386-64bit Darwin Kernel Version 23.6.0: Mon Jul 29 21:14:30 PDT 2024; root:xnu-10063.141.2~1/RELEASE_ARM64_T6000
+- Python: 3.11.9
+- Stable-Baselines3: 2.3.2
+- PyTorch: 2.3.1
+- GPU Enabled: False
+- Numpy: 1.26.4
+- Cloudpickle: 3.0.0
+- Gymnasium: 0.29.1
+- OpenAI Gym: 0.17.2

README.md ADDED Viewed

+---
+library_name: stable-baselines3
+tags:
+- CarRacing-v2
+- deep-reinforcement-learning
+- reinforcement-learning
+- stable-baselines3
+model-index:
+- name: PPO
+  results:
+  - task:
+      type: reinforcement-learning
+      name: reinforcement-learning
+    dataset:
+      name: CarRacing-v2
+      type: CarRacing-v2
+    metrics:
+    - type: mean_reward
+      value: 850.01 +/- 137.65
+      name: mean_reward
+      verified: false
+---
+# **PPO** Agent playing **CarRacing-v2**
+This is a trained model of a **PPO** agent playing **CarRacing-v2**
+using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
+## Usage (with Stable-baselines3)
+TODO: Add your code
+```python
+from stable_baselines3 import ...
+from huggingface_sb3 import load_from_hub
+...
+```

config.json ADDED Viewed

The diff for this file is too large to render. See raw diff

results.json ADDED Viewed

	@@ -0,0 +1 @@


1	+ {"mean_reward": 850.0146951, "std_reward": 137.65236984414358, "is_deterministic": true, "n_eval_episodes": 10, "eval_datetime": "2024-09-19T13:37:19.028795"}