make sure to register the base chatml template even if no system message is provided (#1207) badda37 unverified winglian commited on Jan 25
Feat/chatml add system message (#1117) 98b4762 unverified mhenrichsen Mads Henrichsen winglian commited on Jan 25
more dpo fixes for dataset loading and docs (#1185) [skip ci] 5bce45f unverified winglian commited on Jan 24
Fix generation_config validation raises Exception for do_merge_lora (#1184) 02f2c72 unverified tisorlawan commited on Jan 24
Add support for offline mode with HF_HUB_OFFLINE envvar (#1182) 71141de unverified James Wade winglian commited on Jan 24
don't fail if can't cast weights due to offload when merging (#1172) [skip ci] fb7f9b9 unverified winglian commited on Jan 23
Disable caching on `--disable_caching` in CLI (#1110) d66b101 unverified casperhansen winglian commited on Jan 13
update sharegpt conversations when chatml chat template is set (#1075) [skip ci] 0ce1a65 unverified winglian commited on Jan 10
Add: mlflow for experiment tracking (#1059) [skip ci] 090c24d unverified Johan Hansson winglian commited on Jan 9
feature: better device mapping for large models (#918) bdfefaf unverified dg-kalle Karl-Johan Alm winglian commited on Jan 5
Fix: bf16 support for inference (#981) 3678a6c unverified Tazik Shahjahan winglian commited on Dec 29, 2023
feat: remove need to add load_in* during merge (#1017) f6ecf14 unverified Nanobit commited on Dec 29, 2023
remove landmark attn and xpos rope implementations (#1010) 70b46ca unverified winglian commited on Dec 28, 2023
ensure merged model matches the training dtype (#902) 1d21aa6 unverified winglian commited on Nov 29, 2023
Determine FSDP/deepspeed settings on device select. (#883) 71b7ea3 unverified user735 Karl-Johan Alm winglian commited on Nov 29, 2023
include the suffix modified string in ascii art (#852) 614cff4 unverified fpreiss commited on Nov 15, 2023
improve handling of the prepared ds path and other cfg defaults (#701) 1c412c7 unverified winglian commited on Oct 13, 2023
Save Axolotl config as WandB artifact (#716) 490923f unverified Jan Philipp Harries commited on Oct 11, 2023
prepared dataset caching, other misc fixes (#665) e50a64e unverified winglian commited on Oct 3, 2023
prevent cli functions from getting fired on import (#581) 8dcd40a unverified winglian commited on Sep 15, 2023
refactor scripts/finetune.py into new cli modules (#550) 861ceca unverified winglian Nanobit commited on Sep 15, 2023