don't compile deepspeed or bitsandbytes from source (#837) f544ab2 unverified winglian commited on Nov 9, 2023
chore: bump transformers to v4.34.1 to fix tokenizer issue (#745) 8966a6f unverified Nanobit commited on Oct 20, 2023
Fix(version): Update FA to work with Mistral SWA (#673) 43856c0 unverified Nanobit commited on Oct 4, 2023
Feat: Allow usage of native Mistral FA when no sample_packing (#669) 697c50d unverified Nanobit commited on Oct 4, 2023
update readme to point to direct link to runpod template, cleanup install instrucitons (#532) 34c0a86 unverified winglian commited on Sep 8, 2023
Add support for GPTQ using native transformers/peft (#468) 3355706 unverified winglian commited on Sep 5, 2023
flash attn pip install (#426) cf66547 unverified mhenrichsen Ubuntu mhenrichsen Mads Henrichsen winglian commited on Aug 18, 2023
Attention mask and position id fixes for packing (#285) 2bb0b78 unverified winglian commited on Aug 12, 2023
Merge pull request #355 from tmm1/bitsandbytes-fixes 35c8b90 unverified tmm1 commited on Aug 11, 2023
latest HEAD of accelerate causes 0 loss immediately w FSDP (#321) 9f69c4d unverified winglian commited on Jul 24, 2023
update docker to compile latest bnb to properly support qlora 312b8d5 winglian commited on May 27, 2023
quickstart instructions for starting from runpod (#5) 0a472e1 unverified winglian commited on Apr 18, 2023