drop length column for issues with eval without packing (#1711) 3f1f5e3 unverified winglian commited on Jun 19
bump deepspeed for fix for grad norm compute putting tensors on different devices (#1699) 851ccb1 unverified winglian commited on Jun 9
fix for when sample_packing and eval_sample_packing are different (#1695) 18cabc0 unverified winglian commited on Jun 8
add back packing efficiency estimate so epochs and multi-gpu works properly (#1697) ed8ef65 unverified winglian commited on Jun 8
ensure explicit eval_sample_packing to avoid mismatch issues (#1692) 9c1af1a unverified winglian commited on Jun 7
Phi-3 conversation format, example training script and perplexity metric (#1582) cf64284 unverified roborovski winglian commited on Jun 4
cleanup the deepspeed proxy model at the end of training (#1675) d4f6c65 unverified winglian commited on May 30
set chat_template in datasets config automatically (#1664) 9d4225a unverified winglian commited on May 30
use mixins for orpo and kto configs so they work with axolotl customizations (#1674) f7332ac unverified winglian commited on May 30
make sure the CI fails when pytest script fails (#1669) fe650dd unverified winglian commited on May 29
Fix README quick start example usage model dirs (#1668) 49b967b unverified Abe Voelker commited on May 28
Correct name of MixtralBlockSparseTop2MLP (L -> l) (#1667) 65db903 unverified seungduk commited on May 28
Fix: ensure correct handling of `val_set_size` as `float` or `int` (#1655) 6a5a725 unverified Davide Caroselli winglian commited on May 28
Generalizing the chat_template prompt strategy (#1660) [skip ci] cc11c6b unverified fozziethebeat commited on May 28
document how to use `share_strategy="no"` (#1653) [skip ci] 8a20a7b unverified charlesfrye commited on May 24
Switch to parallel FFD bin packing algorithm. (#1619) 367b2e8 unverified winglian daaave commited on May 23
Update tiny-llama qlora.yml addressing eval packing error (#1638) 84bb806 unverified Jaydeep Thik commited on May 22
enable loraplus setting for dpo trainer (#1646) a27d5e1 unverified thepowerfuldeez commited on May 22
Fix llama3 chat_template (extra <|eot_id|> on last turn) (#1635) 7c2bf30 unverified leonardlin winglian commited on May 21