Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
asuzuki
/
PPO-LunarLander-v2
like
0
Reinforcement Learning
Transformers
TensorBoard
LunarLander-v2
ppo
deep-reinforcement-learning
custom-implementation
deep-rl-course
Eval Results
Inference Endpoints
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
PPO-LunarLander-v2
1 contributor
History:
7 commits
asuzuki
Push agent to the Hub
30b1eab
over 1 year ago
logs
Push agent to the Hub
over 1 year ago
ppo-LunarLander-v2
first commit - model PPO performing good
almost 2 years ago
.gitattributes
Safe
1.48 kB
initial commit
almost 2 years ago
README.md
Safe
1.16 kB
Push agent to the Hub
over 1 year ago
config.json
Safe
14.4 kB
first commit - model PPO performing good
almost 2 years ago
lunar_lander_v2.ipynb
Safe
355 kB
updated code
almost 2 years ago
model.pt
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
42.6 kB
LFS
Push agent to the Hub
over 1 year ago
ppo-LunarLander-v2.zip
Safe
147 kB
LFS
first commit - model PPO performing good
almost 2 years ago
replay.mp4
Safe
27.3 kB
Push agent to the Hub
over 1 year ago
results.json
Safe
174 Bytes
Push agent to the Hub
over 1 year ago