use save_strategy from config if available (#434) b3f5e00 unverified winglian commited on Aug 19, 2023
flash attn pip install (#426) cf66547 unverified mhenrichsen Ubuntu mhenrichsen Mads Henrichsen winglian commited on Aug 18, 2023
fix fixture for new tokenizer handling in transformers (#428) 8cace80 unverified winglian commited on Aug 17, 2023
Fix(docs): Remove gptq+lora and fix xformer compat list (#423) 3d1f203 unverified Nanobit commited on Aug 16, 2023
Fix(template): Inform to place stack trace to Issue (#417) b7449a9 unverified Nanobit winglian commited on Aug 16, 2023
use inputs for image rather than outputs for docker metadata (#420) 5f80b35 unverified winglian commited on Aug 15, 2023
tag with latest as well for axolotl-runpod (#418) 7af8166 unverified winglian commited on Aug 15, 2023
Merge pull request #413 from mhenrichsen/chore/update-deepseed-config f806e86 unverified mhenrichsen commited on Aug 15, 2023
Feat(doc): Add lr_quadratic_warmup to readme (#412) 2b990eb unverified Nanobit commited on Aug 15, 2023
Fix(config): Update handling of deepspeed config (#404) c01015f unverified Nanobit commited on Aug 15, 2023
Fix(template): Remove iPhone/android from Issue template (#407) 7ad37cb unverified Nanobit commited on Aug 15, 2023
add templates, CoC and contributing guide (#126) 31db0ec unverified lightningRalf winglian Nanobit commited on Aug 15, 2023
better handling of empty input ids when tokenizing (#395) 85cf4f8 unverified winglian commited on Aug 15, 2023
use context manager to run things on rank0 before others (#397) fc2d6be unverified winglian commited on Aug 15, 2023
Error msg for sharegpt if conv has less than 2 msg (#379) 63fdb5a unverified flotos commited on Aug 14, 2023
new llama-2 default settings (#370) fdffef5 unverified mhenrichsen Mads Henrichsen commited on Aug 14, 2023
don't pass rope_scaling kwarg if it's None (#383) 919246f unverified winglian commited on Aug 13, 2023
bump flash-attn to 2.0.4 for the base docker image (#382) ffac902 unverified winglian commited on Aug 13, 2023
try to detect accelerate and only use device_map=None in that case (#373) 094fc2c unverified tmm1 commited on Aug 13, 2023
revert previous change and build ax images w docker on gpu (#371) 918f1b0 unverified winglian commited on Aug 13, 2023