Jarlaxle
commited on
Commit
·
766f9ab
1
Parent(s):
1d6b651
Update README.md
Browse files
README.md
CHANGED
@@ -26,4 +26,6 @@ model-index:
|
|
26 |
To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction
|
27 |
|
28 |
Some notes: I used gymnasium and not gym, so I had to modify some instructions to make the page work. I trained the agent for 2000 episodes and I achieved a model that never makes the pole fall!!!
|
|
|
|
|
29 |
|
|
|
26 |
To learn to use this model and train yours check Unit 4 of the Deep Reinforcement Learning Course: https://huggingface.co/deep-rl-course/unit4/introduction
|
27 |
|
28 |
Some notes: I used gymnasium and not gym, so I had to modify some instructions to make the page work. I trained the agent for 2000 episodes and I achieved a model that never makes the pole fall!!!
|
29 |
+
|
30 |
+
The final score is 100000 only because I let the scoring function evaluate a maximum of 100000 steps ... but the score could be +infinity if I let the game going on forever!!!
|
31 |
|