DeepRL-PPO-LLv2 / LunarLander-v2-PPO-305 /_stable_baselines3_version
0x05a4's picture
Baseline: LR=5e-4/cosine-100, epochs=1e7/305
ab2dd36
raw
history blame
7 Bytes
2.0.0a5