Commit History
DBRX Model Support (#1462)
132eb74
unverified
use locale agnostic seperator to make large nums easier to read (#1503)
da9b1a3
unverified
WIP: Support table logging for mlflow, too (#1506)
057fa44
unverified
Correctly handle splits for datasets.arrow_dataset.Dataset objects (#1504)
8fa0785
unverified
Print versions (#1496)
4313b1a
unverified
add field to sft dataset pydantic for completion support (#1497)
ff01c45
unverified
ignore issues with calculating # params when printing (#1493)
2fa65b9
unverified
Remove `validate_quantized_dora` (#1485)
9430b6e
unverified
xzuyn
commited on
drop empty token from beginning if tokenizer has no bos_token (in the case of qwen) (#1490)
934fc85
unverified
fix: reduce sample_packing warning (#1484)
bda48f0
unverified
feat: validate sample packing requires flash_attention (#1465)
bf4cd67
unverified
add support for cohere chat template (#1478)
05b0b7e
unverified
don't use deepspeed or fsdp when merging loras (#1479)
87ca3f9
unverified
refactor utils.data module for line count linter (#1476)
e0fcef4
unverified
Pretrain multipack v2 (#1470)
5aa5097
unverified
fix pretraining_ on odd datasets (#1463)
586bd8d
unverified
reduce verbosity of the special tokens (#1472)
0b10377
unverified
qwen2_moe support w multipack (#1455)
6086be8
unverified
fix some of the edge cases for Jamba (#1452)
05b398a
unverified
Support loading datasets saved via save_to_disk (#1432)
e634118
unverified
Keith Stevens
commited on