ppo-LunarLander-v2-optuna-pars
/
ppo-jarlaxle-LunarLander-v2-optuna-pars
/_stable_baselines3_version
Jarlaxle
My best trained agent for LunarLander-v2 task. I selected learning rate, gamma, gae_lambda and entropy through optuna.
e04d742
2.2.1 |