Save Axolotl config as WandB artifact (#716) 490923f unverified Jan Philipp Harries commited on Oct 11, 2023
refactor to set eval_batch_size earlier if unset, so we can warn if mismatched (#662) 2642cae unverified winglian commited on Oct 3, 2023
Fix(cfg): Add validation for save_strategy and eval_strategy (#633) 383f88d unverified Nanobit commited on Sep 28, 2023
chore(callback): Remove old peft saving code (#510) d5f8589 unverified Nanobit commited on Sep 22, 2023
run eval on the first step to get a baseline (#617) 2844eb2 unverified winglian commited on Sep 22, 2023
gather/broadcast the max value of the packing efficiency automatically (#463) b15b19e unverified winglian commited on Sep 17, 2023
optionally configure sample packing for evals (#589) 21ec195 unverified winglian commited on Sep 16, 2023
fix save_steps so it doesn't get duplicated (#567) 3fbde76 unverified winglian commited on Sep 14, 2023
improve how we setup eval/save strategies and steps (#547) 36e53c7 unverified winglian commited on Sep 13, 2023
Add training callback to send predictions to WandB table (#521) 5b67ea9 unverified Glavin001 commited on Sep 13, 2023
Add support for GPTQ using native transformers/peft (#468) 3355706 unverified winglian commited on Sep 5, 2023
Added advanced DDP args (#515) 396a7a7 unverified Jan Philipp Harries Jan Philipp Harries commited on Aug 31, 2023
pad_to_worst_case_seq_len boolean, for testing memory limits (#498) 8e197f6 unverified Birch-san tmm1 commited on Aug 28, 2023
ReLoRA implementation (with quantization) (#322) bde3c5a unverified chargoddard winglian commited on Aug 24, 2023
use save_strategy from config if available (#434) b3f5e00 unverified winglian commited on Aug 19, 2023
Fix(config): Update handling of deepspeed config (#404) c01015f unverified Nanobit commited on Aug 15, 2023
Attention mask and position id fixes for packing (#285) 2bb0b78 unverified winglian commited on Aug 12, 2023
Merge branch 'OpenAccess-AI-Collective:main' into logging_enhancement 83237b8 unverified The Objective Dad commited on Jul 15, 2023
Merge pull request #274 from OpenAccess-AI-Collective/NanoCode012-patch-2 168a7a0 unverified Nanobit commited on Jul 14, 2023