Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
0x05a4
/
DeepRL-PPO-LLv2
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
ab2dd36
DeepRL-PPO-LLv2
/
LunarLander-v2-PPO-305
/
_stable_baselines3_version
0x05a4
Baseline: LR=5e-4/cosine-100, epochs=1e7/305
ab2dd36
over 1 year ago
raw
Copy download link
history
blame
Safe
7 Bytes
2
.
0
.
0
a5