Commit History

fix new dataset prompt tokenizers
0f74464

winglian commited on

add missing __init__
e0602a9

winglian commited on

pygmalion dataset prompts format, cached tokenized datasets should be hashed on the tokenizer too
2809f3f

winglian commited on

tokenization fixes
4ea9a66

winglian commited on

Merge pull request #32 from NanoCode012/patch-2
ed37b22
unverified

winglian commited on

optionally be able to specify alpaca or chat style prompts
1d5ab84

winglian commited on

Set `half` using `cfg.fp16` for 4bit
641f801
unverified

Nanobit commited on

update entrypoint and force min accelerate
fa8bd14

winglian commited on

concise multiple choice and tldr summarize
1365073

winglian commited on

support for replit lm
8c2f3cb

winglian commited on

add alpaca multiple choice instruct dataset support
b46bc02

winglian commited on

Merge pull request #29 from NanoCode012/patch-1
e553c90
unverified

winglian commited on

Add `lora_modules_to_save`
2c73c81
unverified

Nanobit commited on

reorder options so debug can happen in the same prepare step
f98e173

winglian commited on

fix prompters, especially the sharegpt prompter
5e37144

winglian commited on

more fixes
bdbca8f

winglian commited on

more fixes
42410c7

winglian commited on

fix torch_dtype for model load
aef00b6

winglian commited on

move filter to before saving so it doesn't happen everytime, update runpod manual script
0d28df0

winglian commited on

whoops, gt vs lt
84c7bc4

winglian commited on

optimize dataloading to use cache, fix model token embedding sizes
aa3c3f9

winglian commited on

Merge pull request #25 from NanoCode012/patch-2
f6d1fa4
unverified

winglian commited on

Merge branch 'main' into patch-2
89b7f26
unverified

Nanobit commited on

fix config for parity with previous change
165da58

winglian commited on

Merge pull request #27 from NanoCode012/patch-1
4cc7ed8
unverified

winglian commited on

Fix typo
52aada7
unverified

Nanobit commited on

Merge pull request #26 from OpenAccess-AI-Collective/mpt-triton
688c73a
unverified

winglian commited on

black formatting
2bc1a5b

winglian commited on

various fixes
7a490a4

winglian commited on

Fix Trainer() got multiple values for keyword argument 'callbacks'
813aab3
unverified

Nanobit commited on

testing mpt triton
e2e68c3

winglian commited on

fix conditional so alpaca doesn't choke
a27d594

winglian commited on

Merge pull request #23 from NanoCode012/patch-1
1fb0376
unverified

winglian commited on

Update finetune.py
915c56c
unverified

winglian commited on

not everyone has bf16 available
df9c508

winglian commited on

add 4bit lora 7b
7967cd1

winglian commited on

Don't save full model for lora
cd23959
unverified

Nanobit commited on

Save adapter for lora
71a1f7f
unverified

Nanobit commited on

push up redpajama 3b example
02c5983

winglian commited on

Merge pull request #15 from NanoCode012/feat/completion
3f9c953
unverified

winglian commited on

Rename variable to use same convention
174b74d

Nanobit commited on

Add CompletionPrompt type
cf68153

Nanobit commited on

Merge pull request #21 from NanoCode012/patch-1
bd3c5a5
unverified

winglian commited on

Merge pull request #19 from NanoCode012/feat/callback-save-lora
bcbc99e
unverified

winglian commited on

Merge pull request #22 from NanoCode012/patch-2
b0d2594
unverified

winglian commited on

Fix BNB OOM by pinning version
fe582df
unverified

Nanobit commited on

Update trainer.py
36aaea0
unverified

Nanobit commited on

Fix condition scheduler
5b6690a
unverified

Nanobit commited on

add support for trust_remote_code for mpt models
a125693

winglian commited on

use printf instead of echo in dockerfile for portability
709be5a

winglian commited on