swap batch size for gradient accumulation steps to decouple from num gpu c2a0792 winglian commited on May 31, 2023
Update wandb_log_model on llama_13B_alpaca.yml 0736f4f unverified Viktorius Suwandi commited on May 29, 2023
tweaks to data loading, 8 bit adam, accelerate and deepspeed 097d367 winglian commited on Apr 22, 2023