Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
0x05a4
/
DeepRL-PPO-LLv2
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
main
DeepRL-PPO-LLv2
/
LunarLander-v2-PPO
1 contributor
History:
4 commits
0x05a4
Baseline: LR=3e-4/.996, epochs=2e6
678a575
over 2 years ago
_stable_baselines3_version
Safe
5 Bytes
Baseline 1M epochs
over 2 years ago
data
Safe
16.3 kB
Baseline: LR=3e-4/.996, epochs=2e6
over 2 years ago
policy.optimizer.pth
Safe
84.9 kB
LFS
Baseline: LR=3e-4/.996, epochs=2e6
over 2 years ago
policy.pth
Safe
43.2 kB
LFS
Baseline: LR=3e-4/.996, epochs=2e6
over 2 years ago
pytorch_variables.pth
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
431 Bytes
LFS
Baseline 1M epochs
over 2 years ago
system_info.txt
Safe
193 Bytes
Baseline 1M epochs
over 2 years ago