DeepRL-PPO-LLv2 / LunarLander-v2-PPO-305 /_stable_baselines3_version

Commit History

Baseline: LR=5e-4/cosine-100, epochs=1e7/305
ab2dd36

0x05a4 commited on