c4c7cee
1
2
3
4
5
6
7
optim: _target_: torch.optim.Adam lr: 1e-3 betas: [0.9, 0.999] weight_decay: 0.01 exclude_ln_and_biases_from_weight_decay: False