0x05a4
/

DeepRL-PPO-LLv2

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions Community

DeepRL-PPO-LLv2

1 contributor

History: 4 commits

0x05a4's picture

Baseline: LR=3e-4/.99, epochs=2e6

f588a6c over 2 years ago