Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Hans14
/
LunarLander-v2
like
0
Reinforcement Learning
Transformers
TensorBoard
LunarLander-v2
ppo
deep-reinforcement-learning
custom-implementation
deep-rl-course
Eval Results
Inference Endpoints
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
fe43219
LunarLander-v2
1 contributor
History:
8 commits
Hans14
Push agent to the Hub
fe43219
over 1 year ago
logs
Push agent to the Hub
over 1 year ago
pop-lunar-lander-test-2
UPLOAD Model version 1 : no hyperparameter trained on 1M step PPO architecture. Mean_reward 263.07035025211746 +/- std_reward 15.52574254837321
over 1 year ago
.gitattributes
1.48 kB
initial commit
over 1 year ago
README.md
1.15 kB
Push agent to the Hub
over 1 year ago
config.json
12.8 kB
UPLOAD Model version 1 : no hyperparameter trained on 1M step PPO architecture. Mean_reward 263.07035025211746 +/- std_reward 15.52574254837321
over 1 year ago
model.pt
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
42.6 kB
LFS
Push agent to the Hub
over 1 year ago
pop-lunar-lander-test-2.zip
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
146 kB
LFS
UPLOAD Model version 1 : no hyperparameter trained on 1M step PPO architecture. Mean_reward 263.07035025211746 +/- std_reward 15.52574254837321
over 1 year ago
replay.mp4
33.7 kB
Push agent to the Hub
over 1 year ago
results.json
173 Bytes
Push agent to the Hub
over 1 year ago