kimnguyenwork commited on
Commit
8fba2b7
·
1 Parent(s): e1c6299

Push agent to the Hub

Browse files
README.md CHANGED
@@ -1,12 +1,13 @@
1
  ---
2
  tags:
3
  - CartPole-v1
4
- - reinforce
 
5
  - reinforcement-learning
6
  - custom-implementation
7
- - deep-rl-class
8
  model-index:
9
- - name: cartpole-v1
10
  results:
11
  - task:
12
  type: reinforcement-learning
@@ -16,12 +17,45 @@ model-index:
16
  type: CartPole-v1
17
  metrics:
18
  - type: mean_reward
19
- value: 500.00 +/- 0.00
20
  name: mean_reward
21
  verified: false
22
  ---
23
 
24
- # **Reinforce** Agent playing **CartPole-v1**
25
- This is a trained model of a **Reinforce** agent playing **CartPole-v1** .
26
- To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
27
 
 
1
  ---
2
  tags:
3
  - CartPole-v1
4
+ - ppo
5
+ - deep-reinforcement-learning
6
  - reinforcement-learning
7
  - custom-implementation
8
+ - deep-rl-course
9
  model-index:
10
+ - name: PPO
11
  results:
12
  - task:
13
  type: reinforcement-learning
 
17
  type: CartPole-v1
18
  metrics:
19
  - type: mean_reward
20
+ value: 203.30 +/- 123.31
21
  name: mean_reward
22
  verified: false
23
  ---
24
 
25
+ # PPO Agent Playing CartPole-v1
26
+
27
+ This is a trained model of a PPO agent playing CartPole-v1.
28
+
29
+ # Hyperparameters
30
+ ```python
31
+ {'exp_name': 'ppo'
32
+ 'seed': 1
33
+ 'torch_deterministic': True
34
+ 'cuda': True
35
+ 'track': False
36
+ 'wandb_project_name': 'cleanRL'
37
+ 'wandb_entity': None
38
+ 'capture_video': False
39
+ 'env_id': 'CartPole-v1'
40
+ 'total_timesteps': 50000
41
+ 'learning_rate': 0.00025
42
+ 'num_envs': 4
43
+ 'num_steps': 128
44
+ 'anneal_lr': True
45
+ 'gae': True
46
+ 'gamma': 0.99
47
+ 'gae_lambda': 0.95
48
+ 'num_minibatches': 4
49
+ 'update_epochs': 4
50
+ 'norm_adv': True
51
+ 'clip_coef': 0.2
52
+ 'clip_vloss': True
53
+ 'ent_coef': 0.01
54
+ 'vf_coef': 0.5
55
+ 'max_grad_norm': 0.5
56
+ 'target_kl': None
57
+ 'repo_id': 'kimnguyenwork/cartpole-v1'
58
+ 'batch_size': 512
59
+ 'minibatch_size': 128}
60
+ ```
61
 
logs/events.out.tfevents.1694025227.535916435daa.5815.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9ea9fa69c307feaaac6484ce7331b581cc5f027fd6f623b5292d222d7447f6cc
3
+ size 109184
model.pt CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:34402f9e5e079c02306fe667e3d00144dbf4fd284c1b3abb804f9f384a775307
3
- size 2771
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5705843cd922e3a7e46bddd7ddcec1dba41eaa63d0fb0d22bc85ae5ff88e5df4
3
+ size 40037
replay.mp4 CHANGED
Binary files a/replay.mp4 and b/replay.mp4 differ
 
results.json CHANGED
@@ -1 +1 @@
1
- {"env_id": "CartPole-v1", "mean_reward": 500.0, "n_evaluation_episodes": 10, "eval_datetime": "2023-09-06T18:16:25.444884"}
 
1
+ {"env_id": "CartPole-v1", "mean_reward": 203.3, "std_reward": 123.30616367400295, "n_evaluation_episodes": 10, "eval_datetime": "2023-09-06T18:34:21.547215"}