Fix(config): Update handling of deepspeed config (#404) c01015f unverified Nanobit commited on Aug 15, 2023
Fix(template): Remove iPhone/android from Issue template (#407) 7ad37cb unverified Nanobit commited on Aug 15, 2023
add templates, CoC and contributing guide (#126) 31db0ec unverified lightningRalf winglian Nanobit commited on Aug 15, 2023
better handling of empty input ids when tokenizing (#395) 85cf4f8 unverified winglian commited on Aug 15, 2023
use context manager to run things on rank0 before others (#397) fc2d6be unverified winglian commited on Aug 15, 2023
Error msg for sharegpt if conv has less than 2 msg (#379) 63fdb5a unverified flotos commited on Aug 14, 2023
new llama-2 default settings (#370) fdffef5 unverified mhenrichsen Mads Henrichsen commited on Aug 14, 2023
don't pass rope_scaling kwarg if it's None (#383) 919246f unverified winglian commited on Aug 13, 2023
bump flash-attn to 2.0.4 for the base docker image (#382) ffac902 unverified winglian commited on Aug 13, 2023
try to detect accelerate and only use device_map=None in that case (#373) 094fc2c unverified tmm1 commited on Aug 13, 2023
revert previous change and build ax images w docker on gpu (#371) 918f1b0 unverified winglian commited on Aug 13, 2023
attempt to run non-base docker builds on regular cpu hosts (#369) c3fde36 unverified winglian commited on Aug 12, 2023
Attention mask and position id fixes for packing (#285) 2bb0b78 unverified winglian commited on Aug 12, 2023
Add wandb_entity to wandb options, update example configs, update README (#361) 7019509 unverified Morgan McGuire Morgan McGuire winglian commited on Aug 12, 2023
Fix(model loading): Warn when model revision is passed to gptq (#364) 96bd6ae unverified Nanobit commited on Aug 12, 2023
Fix(message): Improve error message for bad format (#365) e37d935 unverified Nanobit commited on Aug 12, 2023
Merge pull request #355 from tmm1/bitsandbytes-fixes 35c8b90 unverified tmm1 commited on Aug 11, 2023
Merge pull request #350 from tmm1/group-len-false-examples f5c11f8 unverified tmm1 commited on Aug 9, 2023