Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
0x05a4
/
DeepRL-PPO-LLv2
like
0
Reinforcement Learning
stable-baselines3
LunarLander-v2
deep-reinforcement-learning
Eval Results
Model card
Files
Files and versions
Community
Use this model
main
DeepRL-PPO-LLv2
Commit History
Baseline: LR=5e-4/cosine-100, epochs=1e7/305
ab2dd36
0x05a4
commited on
Jun 16, 2023
Baseline: LR=3e-4/.996, epochs=2e6
678a575
0x05a4
commited on
May 14, 2022
Baseline: LR=3e-4/.99, epochs=2e6
f588a6c
0x05a4
commited on
May 14, 2022
Baseline: LR=1e-4, epochs=1e6
1d38dd0
0x05a4
commited on
May 14, 2022
Baseline 1M epochs
6e56ef6
0x05a4
commited on
May 14, 2022
initial commit
df263ec
0x05a4
commited on
May 14, 2022