Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
0x05a4
/
DeepRL-PPO-LLv2
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
ab2dd36
DeepRL-PPO-LLv2
1 contributor
History:
6 commits
0x05a4
Baseline: LR=5e-4/cosine-100, epochs=1e7/305
ab2dd36
over 1 year ago
LunarLander-v2-PPO-305
Baseline: LR=5e-4/cosine-100, epochs=1e7/305
over 1 year ago
LunarLander-v2-PPO
Baseline: LR=3e-4/.996, epochs=2e6
over 2 years ago
.gitattributes
Safe
1.22 kB
Baseline 1M epochs
over 2 years ago
LunarLander-v2-PPO-305.zip
Safe
147 kB
LFS
Baseline: LR=5e-4/cosine-100, epochs=1e7/305
over 1 year ago
LunarLander-v2-PPO.zip
Safe
146 kB
LFS
Baseline: LR=3e-4/.996, epochs=2e6
over 2 years ago
README.md
Safe
784 Bytes
Baseline: LR=5e-4/cosine-100, epochs=1e7/305
over 1 year ago
config.json
Safe
14.4 kB
Baseline: LR=5e-4/cosine-100, epochs=1e7/305
over 1 year ago
replay.mp4
Safe
158 kB
LFS
Baseline: LR=5e-4/cosine-100, epochs=1e7/305
over 1 year ago
results.json
Safe
157 Bytes
Baseline: LR=5e-4/cosine-100, epochs=1e7/305
over 1 year ago