Commits · Dovakiins/qwerrwe

hanging slash typo

17345c8

winglian commited on May 7, 2023

build on self hosted GPU runners

9cd5d3f

winglian commited on May 7, 2023

docker layer caching, build w axolotl from base build

990bec6

winglian commited on May 7, 2023

typo in git repo for pip

0c46806

winglian commited on May 7, 2023

add huggingface packages and awscli

66fa751

winglian commited on May 7, 2023

fix typo and add apex

21b7439

winglian commited on May 7, 2023

needs libaio-dev from apt

3f11b47

winglian commited on May 7, 2023

pip install packaging dep

ece46b2

winglian commited on May 7, 2023

build dependencies and aws-cli

92d800a

winglian commited on May 7, 2023

build base separately

2734e3f

winglian commited on May 7, 2023

build base too

14ebd2e

winglian commited on May 7, 2023

fix push to docker hub

4a79dab

winglian commited on May 7, 2023

fix whitespace and instruction on inference

47ad389

winglian commited on May 7, 2023

push to docker hub

76b24bc

winglian commited on May 7, 2023

TORCH_CUDA_ARCH_LIST should be an ARG

73450d9

winglian commited on May 7, 2023

run this on self hosted runner for now

97cf778

winglian commited on May 7, 2023

runs on larger git runner?

e2599ed

winglian commited on May 7, 2023

don't push the image

75bc856

winglian commited on May 7, 2023

run on git commit

15bdbae

winglian commited on May 7, 2023

try docker build on gitlab

6603b37

winglian commited on May 7, 2023

build dockerfile in gha

2634689

winglian commited on May 7, 2023

update stablelm config

4818380

winglian commited on May 7, 2023

refactor inference, warn if model is frozen

247825b

winglian commited on May 7, 2023

Merge pull request #13 from winglian/dev

cb9a887
unverified

winglian commited on May 7, 2023

Merge pull request #12 from NanoCode012/feat/eval_config

a15d823
unverified

winglian commited on May 7, 2023

Add eval_batch_size for evaluation

0e74b64

Nanobit commited on May 6, 2023

fix log sweep lr

a10a826

winglian commited on May 3, 2023

support for multi line inference input, log sweep over learning rates

9105935

winglian commited on May 3, 2023

fix adam bnb optimizer grouped parameters, fix peft model 8bit conversion logic, black formatting

7748f3d

winglian commited on May 1, 2023

install peft from main branch

fe9c29d

winglian commited on May 1, 2023

support llama-adapter zero init attention

2255bb7

winglian commited on May 1, 2023

use prebuilt wheels for flash-attn and deepspeed

55baef0

winglian commited on May 1, 2023

fdsp config dict fix, todo list, add torchdistx support

ad2b48c

winglian commited on Apr 30, 2023

8bit and deepspeed changes

9190ada

winglian commited on Apr 30, 2023

update ds_config

4dbef09

winglian commited on Apr 30, 2023

don't load models in 8bit unless they are using an adapter, also fix tokenizer load in exceptional case

6dfdd2d

winglian commited on Apr 30, 2023

fix fsdp training args

29936bb

winglian commited on Apr 30, 2023

fix for zero value warmup steps

7882181

winglian commited on Apr 30, 2023

fix sharegpt tokenization, refactor tokenization debugging

5159d00

winglian commited on Apr 30, 2023

wire up gradient checkpointing for 4bit

c0f50d9

winglian commited on Apr 29, 2023

Merge pull request #9 from winglian/dev

4e705ed
unverified

winglian commited on Apr 25, 2023

fix dataset handling, support galactica

4a17a4c

winglian commited on Apr 24, 2023

tweaks to data loading, 8 bit adam, accelerate and deepspeed

097d367

winglian commited on Apr 22, 2023

shuffle and split dataset after save/load

4f2584f

winglian commited on Apr 20, 2023

fix sharegpt handling from hf, don't worry about loading llama if using earlier transformers release

8d43785

winglian commited on Apr 20, 2023

stablelm support

8e2a560

winglian commited on Apr 19, 2023

various bugfixes

94f5e41

winglian commited on Apr 19, 2023

ignore config, add python 3.9 (#8)

2624bc2
unverified

ehartford commited on Apr 24, 2023

fix bug when model_type not explicitly passed

bb991fd

winglian commited on Apr 19, 2023

improve inference

d653859

winglian commited on Apr 19, 2023

Commit History

hanging slash typo 17345c8

build on self hosted GPU runners 9cd5d3f

docker layer caching, build w axolotl from base build 990bec6

typo in git repo for pip 0c46806

add huggingface packages and awscli 66fa751

fix typo and add apex 21b7439

needs libaio-dev from apt 3f11b47

pip install packaging dep ece46b2

build dependencies and aws-cli 92d800a

build base separately 2734e3f

build base too 14ebd2e

fix push to docker hub 4a79dab

fix whitespace and instruction on inference 47ad389

push to docker hub 76b24bc

TORCH_CUDA_ARCH_LIST should be an ARG 73450d9

run this on self hosted runner for now 97cf778

runs on larger git runner? e2599ed

don't push the image 75bc856

run on git commit 15bdbae

try docker build on gitlab 6603b37

build dockerfile in gha 2634689

update stablelm config 4818380

refactor inference, warn if model is frozen 247825b

Merge pull request #13 from winglian/dev cb9a887 unverified

Merge pull request #12 from NanoCode012/feat/eval_config a15d823 unverified

Add eval_batch_size for evaluation 0e74b64

fix log sweep lr a10a826

support for multi line inference input, log sweep over learning rates 9105935

fix adam bnb optimizer grouped parameters, fix peft model 8bit conversion logic, black formatting 7748f3d

install peft from main branch fe9c29d

support llama-adapter zero init attention 2255bb7

use prebuilt wheels for flash-attn and deepspeed 55baef0

fdsp config dict fix, todo list, add torchdistx support ad2b48c

8bit and deepspeed changes 9190ada

update ds_config 4dbef09

don't load models in 8bit unless they are using an adapter, also fix tokenizer load in exceptional case 6dfdd2d

fix fsdp training args 29936bb

fix for zero value warmup steps 7882181

fix sharegpt tokenization, refactor tokenization debugging 5159d00

wire up gradient checkpointing for 4bit c0f50d9

Merge pull request #9 from winglian/dev 4e705ed unverified

fix dataset handling, support galactica 4a17a4c

tweaks to data loading, 8 bit adam, accelerate and deepspeed 097d367

shuffle and split dataset after save/load 4f2584f

fix sharegpt handling from hf, don't worry about loading llama if using earlier transformers release 8d43785

stablelm support 8e2a560

various bugfixes 94f5e41

ignore config, add python 3.9 (#8) 2624bc2 unverified

fix bug when model_type not explicitly passed bb991fd

improve inference d653859

hanging slash typo

17345c8

build on self hosted GPU runners

9cd5d3f

docker layer caching, build w axolotl from base build

990bec6

typo in git repo for pip

0c46806

add huggingface packages and awscli

66fa751

fix typo and add apex

21b7439

needs libaio-dev from apt

3f11b47

pip install packaging dep

ece46b2

build dependencies and aws-cli

92d800a

build base separately

2734e3f

build base too

14ebd2e

fix push to docker hub

4a79dab

fix whitespace and instruction on inference

47ad389

push to docker hub

76b24bc

TORCH_CUDA_ARCH_LIST should be an ARG

73450d9

run this on self hosted runner for now

97cf778

runs on larger git runner?

e2599ed

don't push the image

75bc856

run on git commit

15bdbae

try docker build on gitlab

6603b37

build dockerfile in gha

2634689

update stablelm config

4818380

refactor inference, warn if model is frozen

247825b

Merge pull request #13 from winglian/dev

cb9a887
unverified

Merge pull request #12 from NanoCode012/feat/eval_config

a15d823
unverified

Add eval_batch_size for evaluation

0e74b64

fix log sweep lr

a10a826

support for multi line inference input, log sweep over learning rates

9105935

fix adam bnb optimizer grouped parameters, fix peft model 8bit conversion logic, black formatting

7748f3d

install peft from main branch

fe9c29d

support llama-adapter zero init attention

2255bb7

use prebuilt wheels for flash-attn and deepspeed

55baef0

fdsp config dict fix, todo list, add torchdistx support

ad2b48c

8bit and deepspeed changes

9190ada

update ds_config

4dbef09

don't load models in 8bit unless they are using an adapter, also fix tokenizer load in exceptional case

6dfdd2d

fix fsdp training args

29936bb

fix for zero value warmup steps

7882181

fix sharegpt tokenization, refactor tokenization debugging

5159d00

wire up gradient checkpointing for 4bit

c0f50d9

Merge pull request #9 from winglian/dev

4e705ed
unverified

fix dataset handling, support galactica

4a17a4c

tweaks to data loading, 8 bit adam, accelerate and deepspeed

097d367

shuffle and split dataset after save/load

4f2584f

fix sharegpt handling from hf, don't worry about loading llama if using earlier transformers release

8d43785

stablelm support

8e2a560

various bugfixes

94f5e41

ignore config, add python 3.9 (#8)

2624bc2
unverified

fix bug when model_type not explicitly passed

bb991fd

improve inference

d653859