Commit History
make sure to use train split if loading from hf
607a4d3
winglian
commited on
make one cycle lr div factor configurable
99383f1
winglian
commited on
fix new dataset prompt tokenizers
0f74464
winglian
commited on
pygmalion dataset prompts format, cached tokenized datasets should be hashed on the tokenizer too
2809f3f
winglian
commited on
tokenization fixes
4ea9a66
winglian
commited on
optionally be able to specify alpaca or chat style prompts
1d5ab84
winglian
commited on
Set `half` using `cfg.fp16` for 4bit
641f801
unverified
Nanobit
commited on
concise multiple choice and tldr summarize
1365073
winglian
commited on
support for replit lm
8c2f3cb
winglian
commited on
add alpaca multiple choice instruct dataset support
b46bc02
winglian
commited on
Add `lora_modules_to_save`
2c73c81
unverified
Nanobit
commited on
more fixes
bdbca8f
winglian
commited on
more fixes
42410c7
winglian
commited on
fix torch_dtype for model load
aef00b6
winglian
commited on
move filter to before saving so it doesn't happen everytime, update runpod manual script
0d28df0
winglian
commited on
whoops, gt vs lt
84c7bc4
winglian
commited on
optimize dataloading to use cache, fix model token embedding sizes
aa3c3f9
winglian
commited on
Merge branch 'main' into patch-2
89b7f26
unverified
Nanobit
commited on
black formatting
2bc1a5b
winglian
commited on
various fixes
7a490a4
winglian
commited on
Fix Trainer() got multiple values for keyword argument 'callbacks'
813aab3
unverified
Nanobit
commited on
testing mpt triton
e2e68c3
winglian
commited on
fix conditional so alpaca doesn't choke
a27d594
winglian
commited on
Add CompletionPrompt type
cf68153
Nanobit
commited on
Merge pull request #21 from NanoCode012/patch-1
bd3c5a5
unverified
winglian
commited on
Merge pull request #19 from NanoCode012/feat/callback-save-lora
bcbc99e
unverified
winglian
commited on
Update trainer.py
36aaea0
unverified
Nanobit
commited on
Fix condition scheduler
5b6690a
unverified
Nanobit
commited on
add support for trust_remote_code for mpt models
a125693
winglian
commited on
Add callbacks to Trainer
cc77bab
Nanobit
commited on
Add callback save peft_model on_save
0d6708b
Nanobit
commited on
Jeopardy bot! (#17)
a12fb0a
unverified
winglian
commited on
fix #16 load best model setting when using 8bit
a4329b1
winglian
commited on
use micro batch size for eval size if not specified
550502b
winglian
commited on
refactor inference, warn if model is frozen
247825b
winglian
commited on
Merge pull request #13 from winglian/dev
cb9a887
unverified
winglian
commited on
Add eval_batch_size for evaluation
0e74b64
Nanobit
commited on
fix log sweep lr
a10a826
winglian
commited on
support for multi line inference input, log sweep over learning rates
9105935
winglian
commited on
fix adam bnb optimizer grouped parameters, fix peft model 8bit conversion logic, black formatting
7748f3d
winglian
commited on
support llama-adapter zero init attention
2255bb7
winglian
commited on
fdsp config dict fix, todo list, add torchdistx support
ad2b48c
winglian
commited on
8bit and deepspeed changes
9190ada
winglian
commited on
don't load models in 8bit unless they are using an adapter, also fix tokenizer load in exceptional case
6dfdd2d
winglian
commited on
fix fsdp training args
29936bb
winglian
commited on
fix for zero value warmup steps
7882181
winglian
commited on
fix sharegpt tokenization, refactor tokenization debugging
5159d00
winglian
commited on
wire up gradient checkpointing for 4bit
c0f50d9
winglian
commited on
fix dataset handling, support galactica
4a17a4c
winglian
commited on