Commits · Dovakiins/qwerrwe

add streaming dataset support for pretraining datasets

eea2731

winglian commited on Jun 10, 2023

more tweaks to do pre-training with bettertransformers

1210dc8

winglian commited on Jun 1, 2023

experimental expansion of ctx len

488a67d

winglian commited on May 31, 2023

add flash attn context for efficient training and attempt setting model to train mode:

8792199

winglian commited on May 27, 2023

add support for opimum bettertransformers

1edc30c

winglian commited on May 27, 2023

Merge branch 'main' into patch-1

79e2a6f
unverified

Angainor Development commited on Jun 10, 2023

Remove explicit definition of cfg.inference

c250898
unverified

Angainor Development commited on Jun 10, 2023

formatting for linter

f36e227
unverified

winglian commited on Jun 10, 2023

Add streaming inference & fix stopping at EOS

fec6bcc

Glavin001 commited on Jun 10, 2023

Feed cfg.inference

bd3b537
unverified

Angainor Development commited on Jun 9, 2023

Set matmul tf32

52765ac

Nanobit commited on Jun 8, 2023

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len

4ac9e25

winglian commited on Jun 6, 2023

fix device map

74ebbf4

winglian commited on Jun 2, 2023

fix batch size calculation

5a631b3

winglian commited on May 31, 2023

Merge pull request #119 from NanoCode012/feat/update-inference

fac4600
unverified

Nanobit commited on May 31, 2023

Increase max_new_tokens

33d4017
unverified

Nanobit

winglian commited on May 31, 2023

Merge pull request #120 from OpenAccess-AI-Collective/model-from-path

c7021e1
unverified

winglian commited on May 31, 2023

black formatting

6fa40bf

winglian commited on May 31, 2023

add support for gradient accumulation steps

3aad5f3

winglian commited on May 31, 2023

fix up tokenizer config, isort fix

39a208c

winglian commited on May 31, 2023

Feat: Swap to GenerationConfig

988aeb9

Nanobit commited on May 31, 2023

Merge pull request #108 from OpenAccess-AI-Collective/docker-gptq

bbc5bc5
unverified

winglian commited on May 30, 2023

Fix security issue or ignore false positives

a1f9850

Nanobit commited on May 29, 2023

Apply isort then black

37293dc

Nanobit commited on May 29, 2023

Delete extract_lora.py

96e8378

Nanobit commited on May 29, 2023

Fix mypy typing

e9650d3

Nanobit commited on May 29, 2023

Lint finetune.py

82971e1

Nanobit commited on May 29, 2023

Lint and format

392dfd9

Nanobit commited on May 28, 2023

default to qlora support, make gptq specific image

6ef96f5

winglian commited on May 30, 2023

bnb fixes

21f17cc

winglian commited on May 29, 2023

refactor: fix previous refactors

56f9ca5

Nanobit commited on May 28, 2023

Refactor to use DictDefault instead

8bd7a49

Nanobit commited on May 28, 2023

Fix load error

93acb64

Nanobit commited on May 27, 2023

Convert attrdict to addict

bdfe7c9

Nanobit commited on May 27, 2023

move list not in list logic to fn

cc67862

winglian commited on May 27, 2023

load the tokenizer seperately from the model

32e6fe9

winglian commited on May 26, 2023

add logging and make sure model unloads to float16

a5bf838

winglian commited on May 26, 2023

remove un-needed code, add validation

1f5d83e

winglian commited on May 25, 2023

Update scripts/finetune.py

3457810
unverified

winglian

Nanobit commited on May 22, 2023

Update scripts/finetune.py for logging

ae1719d
unverified

winglian

Nanobit commited on May 22, 2023

optionally be able to specify alpaca or chat style prompts

1d5ab84

winglian commited on May 20, 2023

add alpaca multiple choice instruct dataset support

b46bc02

winglian commited on May 17, 2023

reorder options so debug can happen in the same prepare step

f98e173

winglian commited on May 16, 2023

more fixes

bdbca8f

winglian commited on May 15, 2023

move filter to before saving so it doesn't happen everytime, update runpod manual script

0d28df0

winglian commited on May 14, 2023

Fix typo

52aada7
unverified

Nanobit commited on May 11, 2023

black formatting

2bc1a5b

winglian commited on May 10, 2023

Update finetune.py

915c56c
unverified

winglian commited on May 9, 2023

Don't save full model for lora

cd23959
unverified

Nanobit commited on May 9, 2023

Save adapter for lora

71a1f7f
unverified

Nanobit commited on May 9, 2023

Commit History

add streaming dataset support for pretraining datasets eea2731

more tweaks to do pre-training with bettertransformers 1210dc8

experimental expansion of ctx len 488a67d

add flash attn context for efficient training and attempt setting model to train mode: 8792199

add support for opimum bettertransformers 1edc30c

Merge branch 'main' into patch-1 79e2a6f unverified

Remove explicit definition of cfg.inference c250898 unverified

formatting for linter f36e227 unverified

Add streaming inference & fix stopping at EOS fec6bcc

Feed cfg.inference bd3b537 unverified

Set matmul tf32 52765ac

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len 4ac9e25

fix device map 74ebbf4

fix batch size calculation 5a631b3

Merge pull request #119 from NanoCode012/feat/update-inference fac4600 unverified

Increase max_new_tokens 33d4017 unverified

Merge pull request #120 from OpenAccess-AI-Collective/model-from-path c7021e1 unverified

black formatting 6fa40bf

add support for gradient accumulation steps 3aad5f3

fix up tokenizer config, isort fix 39a208c

Feat: Swap to GenerationConfig 988aeb9

Merge pull request #108 from OpenAccess-AI-Collective/docker-gptq bbc5bc5 unverified

Fix security issue or ignore false positives a1f9850

Apply isort then black 37293dc

Delete extract_lora.py 96e8378

Fix mypy typing e9650d3

Lint finetune.py 82971e1

Lint and format 392dfd9

default to qlora support, make gptq specific image 6ef96f5

bnb fixes 21f17cc

refactor: fix previous refactors 56f9ca5

Refactor to use DictDefault instead 8bd7a49

Fix load error 93acb64

Convert attrdict to addict bdfe7c9

move list not in list logic to fn cc67862

load the tokenizer seperately from the model 32e6fe9

add logging and make sure model unloads to float16 a5bf838

remove un-needed code, add validation 1f5d83e

Update scripts/finetune.py 3457810 unverified

Update scripts/finetune.py for logging ae1719d unverified

optionally be able to specify alpaca or chat style prompts 1d5ab84

add alpaca multiple choice instruct dataset support b46bc02

reorder options so debug can happen in the same prepare step f98e173

more fixes bdbca8f

move filter to before saving so it doesn't happen everytime, update runpod manual script 0d28df0

Fix typo 52aada7 unverified

black formatting 2bc1a5b

Update finetune.py 915c56c unverified

Don't save full model for lora cd23959 unverified

Save adapter for lora 71a1f7f unverified

add streaming dataset support for pretraining datasets

eea2731

more tweaks to do pre-training with bettertransformers

1210dc8

experimental expansion of ctx len

488a67d

add flash attn context for efficient training and attempt setting model to train mode:

8792199

add support for opimum bettertransformers

1edc30c

Merge branch 'main' into patch-1

79e2a6f
unverified

Remove explicit definition of cfg.inference

c250898
unverified

formatting for linter

f36e227
unverified

Add streaming inference & fix stopping at EOS

fec6bcc

Feed cfg.inference

bd3b537
unverified

Set matmul tf32

52765ac

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len

4ac9e25

fix device map

74ebbf4

fix batch size calculation

5a631b3

Merge pull request #119 from NanoCode012/feat/update-inference

fac4600
unverified

Increase max_new_tokens

33d4017
unverified

Merge pull request #120 from OpenAccess-AI-Collective/model-from-path

c7021e1
unverified

black formatting

6fa40bf

add support for gradient accumulation steps

3aad5f3

fix up tokenizer config, isort fix

39a208c

Feat: Swap to GenerationConfig

988aeb9

Merge pull request #108 from OpenAccess-AI-Collective/docker-gptq

bbc5bc5
unverified

Fix security issue or ignore false positives

a1f9850

Apply isort then black

37293dc

Delete extract_lora.py

96e8378

Fix mypy typing

e9650d3

Lint finetune.py

82971e1

Lint and format

392dfd9

default to qlora support, make gptq specific image

6ef96f5

bnb fixes

21f17cc

refactor: fix previous refactors

56f9ca5

Refactor to use DictDefault instead

8bd7a49

Fix load error

93acb64

Convert attrdict to addict

bdfe7c9

move list not in list logic to fn

cc67862

load the tokenizer seperately from the model

32e6fe9

add logging and make sure model unloads to float16

a5bf838

remove un-needed code, add validation

1f5d83e

Update scripts/finetune.py

3457810
unverified

Update scripts/finetune.py for logging

ae1719d
unverified

optionally be able to specify alpaca or chat style prompts

1d5ab84

add alpaca multiple choice instruct dataset support

b46bc02

reorder options so debug can happen in the same prepare step

f98e173

more fixes

bdbca8f

move filter to before saving so it doesn't happen everytime, update runpod manual script

0d28df0

Fix typo

52aada7
unverified

black formatting

2bc1a5b

Update finetune.py

915c56c
unverified

Don't save full model for lora

cd23959
unverified

Save adapter for lora

71a1f7f
unverified