0x05a4
/

DeepRL-PPO-LLv2

Reinforcement Learning

stable-baselines3

deep-reinforcement-learning

Model card Files Files and versions Community

DeepRL-PPO-LLv2

Commit History

Baseline: LR=5e-4/cosine-100, epochs=1e7/305

ab2dd36

0x05a4 commited on Jun 16, 2023

Baseline: LR=3e-4/.996, epochs=2e6

678a575

0x05a4 commited on May 14, 2022

Baseline: LR=3e-4/.99, epochs=2e6

f588a6c

0x05a4 commited on May 14, 2022

Baseline: LR=1e-4, epochs=1e6

1d38dd0

0x05a4 commited on May 14, 2022

Baseline 1M epochs

6e56ef6

0x05a4 commited on May 14, 2022

initial commit

df263ec

0x05a4 commited on May 14, 2022