Commits · Dovakiins/qwerrwe

fix new dataset prompt tokenizers

0f74464

winglian commited on May 21, 2023

add missing init

e0602a9

winglian commited on May 21, 2023

pygmalion dataset prompts format, cached tokenized datasets should be hashed on the tokenizer too

2809f3f

winglian commited on May 21, 2023

tokenization fixes

4ea9a66

winglian commited on May 21, 2023

Merge pull request #32 from NanoCode012/patch-2

ed37b22
unverified

winglian commited on May 20, 2023

optionally be able to specify alpaca or chat style prompts

1d5ab84

winglian commited on May 20, 2023

Set `half` using `cfg.fp16` for 4bit

641f801
unverified

Nanobit commited on May 19, 2023

update entrypoint and force min accelerate

fa8bd14

winglian commited on May 18, 2023

concise multiple choice and tldr summarize

1365073

winglian commited on May 17, 2023

support for replit lm

8c2f3cb

winglian commited on May 17, 2023

add alpaca multiple choice instruct dataset support

b46bc02

winglian commited on May 17, 2023

Merge pull request #29 from NanoCode012/patch-1

e553c90
unverified

winglian commited on May 16, 2023

Add `lora_modules_to_save`

2c73c81
unverified

Nanobit commited on May 16, 2023

reorder options so debug can happen in the same prepare step

f98e173

winglian commited on May 16, 2023

fix prompters, especially the sharegpt prompter

5e37144

winglian commited on May 16, 2023

more fixes

bdbca8f

winglian commited on May 15, 2023

more fixes

42410c7

winglian commited on May 14, 2023

fix torch_dtype for model load

aef00b6

winglian commited on May 14, 2023

move filter to before saving so it doesn't happen everytime, update runpod manual script

0d28df0

winglian commited on May 14, 2023

whoops, gt vs lt

84c7bc4

winglian commited on May 12, 2023

optimize dataloading to use cache, fix model token embedding sizes

aa3c3f9

winglian commited on May 12, 2023

Merge pull request #25 from NanoCode012/patch-2

f6d1fa4
unverified

winglian commited on May 11, 2023

Merge branch 'main' into patch-2

89b7f26
unverified

Nanobit commited on May 11, 2023

fix config for parity with previous change

165da58

winglian commited on May 11, 2023

Merge pull request #27 from NanoCode012/patch-1

4cc7ed8
unverified

winglian commited on May 11, 2023

Fix typo

52aada7
unverified

Nanobit commited on May 11, 2023

Merge pull request #26 from OpenAccess-AI-Collective/mpt-triton

688c73a
unverified

winglian commited on May 10, 2023

black formatting

2bc1a5b

winglian commited on May 10, 2023

various fixes

7a490a4

winglian commited on May 10, 2023

Fix Trainer() got multiple values for keyword argument 'callbacks'

813aab3
unverified

Nanobit commited on May 10, 2023

testing mpt triton

e2e68c3

winglian commited on May 10, 2023

fix conditional so alpaca doesn't choke

a27d594

winglian commited on May 10, 2023

Merge pull request #23 from NanoCode012/patch-1

1fb0376
unverified

winglian commited on May 9, 2023

Update finetune.py

915c56c
unverified

winglian commited on May 9, 2023

not everyone has bf16 available

df9c508

winglian commited on May 9, 2023

add 4bit lora 7b

7967cd1

winglian commited on May 9, 2023

Don't save full model for lora

cd23959
unverified

Nanobit commited on May 9, 2023

Save adapter for lora

71a1f7f
unverified

Nanobit commited on May 9, 2023

push up redpajama 3b example

02c5983

winglian commited on May 8, 2023

Merge pull request #15 from NanoCode012/feat/completion

3f9c953
unverified

winglian commited on May 8, 2023

Rename variable to use same convention

174b74d

Nanobit commited on May 8, 2023

Add CompletionPrompt type

cf68153

Nanobit commited on May 8, 2023

Merge pull request #21 from NanoCode012/patch-1

bd3c5a5
unverified

winglian commited on May 8, 2023

Merge pull request #19 from NanoCode012/feat/callback-save-lora

bcbc99e
unverified

winglian commited on May 8, 2023

Merge pull request #22 from NanoCode012/patch-2

b0d2594
unverified

winglian commited on May 8, 2023

Fix BNB OOM by pinning version

fe582df
unverified

Nanobit commited on May 8, 2023

Update trainer.py

36aaea0
unverified

Nanobit commited on May 8, 2023

Fix condition scheduler

5b6690a
unverified

Nanobit commited on May 8, 2023

add support for trust_remote_code for mpt models

a125693

winglian commited on May 8, 2023

use printf instead of echo in dockerfile for portability

709be5a

winglian commited on May 8, 2023

Commit History

fix new dataset prompt tokenizers 0f74464

add missing __init__ e0602a9

pygmalion dataset prompts format, cached tokenized datasets should be hashed on the tokenizer too 2809f3f

tokenization fixes 4ea9a66

Merge pull request #32 from NanoCode012/patch-2 ed37b22 unverified

optionally be able to specify alpaca or chat style prompts 1d5ab84

Set `half` using `cfg.fp16` for 4bit 641f801 unverified

update entrypoint and force min accelerate fa8bd14

concise multiple choice and tldr summarize 1365073

support for replit lm 8c2f3cb

add alpaca multiple choice instruct dataset support b46bc02

Merge pull request #29 from NanoCode012/patch-1 e553c90 unverified

Add `lora_modules_to_save` 2c73c81 unverified

reorder options so debug can happen in the same prepare step f98e173

fix prompters, especially the sharegpt prompter 5e37144

more fixes bdbca8f

more fixes 42410c7

fix torch_dtype for model load aef00b6

move filter to before saving so it doesn't happen everytime, update runpod manual script 0d28df0

whoops, gt vs lt 84c7bc4

optimize dataloading to use cache, fix model token embedding sizes aa3c3f9

Merge pull request #25 from NanoCode012/patch-2 f6d1fa4 unverified

Merge branch 'main' into patch-2 89b7f26 unverified

fix config for parity with previous change 165da58

Merge pull request #27 from NanoCode012/patch-1 4cc7ed8 unverified

Fix typo 52aada7 unverified

Merge pull request #26 from OpenAccess-AI-Collective/mpt-triton 688c73a unverified

black formatting 2bc1a5b

various fixes 7a490a4

Fix Trainer() got multiple values for keyword argument 'callbacks' 813aab3 unverified

testing mpt triton e2e68c3

fix conditional so alpaca doesn't choke a27d594

Merge pull request #23 from NanoCode012/patch-1 1fb0376 unverified

Update finetune.py 915c56c unverified

not everyone has bf16 available df9c508

add 4bit lora 7b 7967cd1

Don't save full model for lora cd23959 unverified

Save adapter for lora 71a1f7f unverified

push up redpajama 3b example 02c5983

Merge pull request #15 from NanoCode012/feat/completion 3f9c953 unverified

Rename variable to use same convention 174b74d

Add CompletionPrompt type cf68153

Merge pull request #21 from NanoCode012/patch-1 bd3c5a5 unverified

Merge pull request #19 from NanoCode012/feat/callback-save-lora bcbc99e unverified

Merge pull request #22 from NanoCode012/patch-2 b0d2594 unverified

Fix BNB OOM by pinning version fe582df unverified

Update trainer.py 36aaea0 unverified

Fix condition scheduler 5b6690a unverified

add support for trust_remote_code for mpt models a125693

use printf instead of echo in dockerfile for portability 709be5a

fix new dataset prompt tokenizers

0f74464

add missing init

e0602a9

pygmalion dataset prompts format, cached tokenized datasets should be hashed on the tokenizer too

2809f3f

tokenization fixes

4ea9a66

Merge pull request #32 from NanoCode012/patch-2

ed37b22
unverified

optionally be able to specify alpaca or chat style prompts

1d5ab84

Set `half` using `cfg.fp16` for 4bit

641f801
unverified

update entrypoint and force min accelerate

fa8bd14

concise multiple choice and tldr summarize

1365073

support for replit lm

8c2f3cb

add alpaca multiple choice instruct dataset support

b46bc02

Merge pull request #29 from NanoCode012/patch-1

e553c90
unverified

Add `lora_modules_to_save`

2c73c81
unverified

reorder options so debug can happen in the same prepare step

f98e173

fix prompters, especially the sharegpt prompter

5e37144

more fixes

bdbca8f

more fixes

42410c7

fix torch_dtype for model load

aef00b6

move filter to before saving so it doesn't happen everytime, update runpod manual script

0d28df0

whoops, gt vs lt

84c7bc4

optimize dataloading to use cache, fix model token embedding sizes

aa3c3f9

Merge pull request #25 from NanoCode012/patch-2

f6d1fa4
unverified

Merge branch 'main' into patch-2

89b7f26
unverified

fix config for parity with previous change

165da58

Merge pull request #27 from NanoCode012/patch-1

4cc7ed8
unverified

Fix typo

52aada7
unverified

Merge pull request #26 from OpenAccess-AI-Collective/mpt-triton

688c73a
unverified

black formatting

2bc1a5b

various fixes

7a490a4

Fix Trainer() got multiple values for keyword argument 'callbacks'

813aab3
unverified

testing mpt triton

e2e68c3

fix conditional so alpaca doesn't choke

a27d594

Merge pull request #23 from NanoCode012/patch-1

1fb0376
unverified

Update finetune.py

915c56c
unverified

not everyone has bf16 available

df9c508

add 4bit lora 7b

7967cd1

Don't save full model for lora

cd23959
unverified

Save adapter for lora

71a1f7f
unverified

push up redpajama 3b example

02c5983

Merge pull request #15 from NanoCode012/feat/completion

3f9c953
unverified

Rename variable to use same convention

174b74d

Add CompletionPrompt type

cf68153

Merge pull request #21 from NanoCode012/patch-1

bd3c5a5
unverified

Merge pull request #19 from NanoCode012/feat/callback-save-lora

bcbc99e
unverified

Merge pull request #22 from NanoCode012/patch-2

b0d2594
unverified

Fix BNB OOM by pinning version

fe582df
unverified

Update trainer.py

36aaea0
unverified

Fix condition scheduler

5b6690a
unverified

add support for trust_remote_code for mpt models

a125693

use printf instead of echo in dockerfile for portability

709be5a