Hans14
/

LunarLander-v2

Reinforcement Learning

deep-reinforcement-learning

custom-implementation

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

1 contributor

History: 8 commits

Hans14's picture

Push agent to the Hub

fe43219 over 1 year ago

logs
Push agent to the Hub over 1 year ago
pop-lunar-lander-test-2
UPLOAD Model version 1 : no hyperparameter trained on 1M step PPO architecture. Mean_reward 263.07035025211746 +/- std_reward 15.52574254837321 over 1 year ago
.gitattributes

1.48 kB

initial commit over 1 year ago
README.md

1.15 kB

Push agent to the Hub over 1 year ago
config.json

12.8 kB

UPLOAD Model version 1 : no hyperparameter trained on 1M step PPO architecture. Mean_reward 263.07035025211746 +/- std_reward 15.52574254837321 over 1 year ago
model.pt
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.FloatStorage"
What is a pickle import?
42.6 kB
LFS

Push agent to the Hub over 1 year ago
pop-lunar-lander-test-2.zip
Pickle imports
- No problematic imports detected
What is a pickle import?
146 kB
LFS

UPLOAD Model version 1 : no hyperparameter trained on 1M step PPO architecture. Mean_reward 263.07035025211746 +/- std_reward 15.52574254837321 over 1 year ago
replay.mp4

33.7 kB

Push agent to the Hub over 1 year ago
results.json

173 Bytes

Push agent to the Hub over 1 year ago