Commits · Dovakiins/qwerrwe

move is_llama_derived_model into normalize_config (#524)

44454ae
unverified

tmm1 commited on Sep 4, 2023

Debug tokenization output: Add ability to output text only (no tokens), and/or specify num samples to see (#511)

48434be
unverified

Tom Jobbins commited on Aug 31, 2023

split train from other cli options (#503)

b21e4a2
unverified

winglian commited on Aug 31, 2023

customizable ascii art (#506)

548787d
unverified

winglian commited on Aug 29, 2023

tweak: use default config file when only one file is present (#501)

36b2e1c
unverified

Maxime commited on Aug 29, 2023

Refactor train cfg cli (#499)

125cccb
unverified

winglian commited on Aug 29, 2023

fix: inference did not move the model to the correct device (#483)

17605b8
unverified

Maxime commited on Aug 26, 2023

ReLoRA implementation (with quantization) (#322)

bde3c5a
unverified

chargoddard

winglian commited on Aug 24, 2023

Ax art (#405)

29241cf
unverified

winglian commited on Aug 15, 2023

add utils.data.prepare_dataset

2e22404

tmm1 commited on Aug 15, 2023

use context manager to run things on rank0 before others (#397)

fc2d6be
unverified

winglian commited on Aug 15, 2023

Feat(config): add max steps (#387)

3c2ad00
unverified

ittailup commited on Aug 14, 2023

save tokenizer before training starts (#380)

86a91e2
unverified

winglian commited on Aug 13, 2023

simplify `load_tokenizer`

efb3b2c

tmm1 commited on Aug 13, 2023

improve GPU logging to break out pytorch cache and system mem

7b55fe6

tmm1 commited on Aug 13, 2023

extract module for working with cfg

8cec513

tmm1 commited on Aug 13, 2023

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

winglian commited on Aug 12, 2023

Fix(save): Save as safetensors (#363)

a276c9c
unverified

Nanobit commited on Aug 12, 2023

feat(merge): save tokenizer on merge (#362)

289d5c4
unverified

Nanobit commited on Aug 12, 2023

Merge pull request #356 from tmm1/load_model-args

11ddccb
unverified

tmm1 commited on Aug 10, 2023

simplify load_model signature

7181022

tmm1 commited on Aug 9, 2023

log GPU memory usage

e303d64

tmm1 commited on Aug 9, 2023

fix FSDP save of final model (#329)

894cba0
unverified

winglian commited on Jul 31, 2023

misc fixes

d75adb9

winglian commited on Jul 17, 2023

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var

b1f4f7a

theobjectivedad commited on Jul 15, 2023

Adding logging enhancement

553a86b

theobjectivedad commited on Jul 14, 2023

Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum

16bb627
unverified

winglian commited on Jun 14, 2023

chore: Refactor inf_kwargs out

dc77c8e

Nanobit commited on Jun 13, 2023

Merge branch 'main' into flash-optimum

fd2c981
unverified

winglian commited on Jun 12, 2023

Merge pull request #177 from NanoCode012/fix/landmark-patch

8002ffb
unverified

winglian commited on Jun 12, 2023

Merge pull request #159 from AngainorDev/patch-1

8e568bb
unverified

Nanobit commited on Jun 12, 2023

Fix strict and Lint

b565ecf

Angainor commited on Jun 11, 2023

Fix set mem_id for inference and refactor

974dc00

Nanobit commited on Jun 11, 2023

Set mem cache args on inference

572d114

Nanobit commited on Jun 11, 2023

fix formatting

958da70

winglian commited on Jun 10, 2023

pass a prompt in from stdin for inference

c4e4f81

winglian commited on Jun 10, 2023

Update scripts/finetune.py

759e867
unverified

winglian

Nanobit commited on Jun 10, 2023

address PR feedback

0c6f928

winglian commited on Jun 10, 2023

add streaming dataset support for pretraining datasets

eea2731

winglian commited on Jun 10, 2023

more tweaks to do pre-training with bettertransformers

1210dc8

winglian commited on Jun 1, 2023

experimental expansion of ctx len

488a67d

winglian commited on May 31, 2023

add flash attn context for efficient training and attempt setting model to train mode:

8792199

winglian commited on May 27, 2023

add support for opimum bettertransformers

1edc30c

winglian commited on May 27, 2023

Merge branch 'main' into patch-1

79e2a6f
unverified

Angainor Development commited on Jun 10, 2023

Remove explicit definition of cfg.inference

c250898
unverified

Angainor Development commited on Jun 10, 2023

formatting for linter

f36e227
unverified

winglian commited on Jun 10, 2023

Add streaming inference & fix stopping at EOS

fec6bcc

Glavin001 commited on Jun 10, 2023

Feed cfg.inference

bd3b537
unverified

Angainor Development commited on Jun 9, 2023

Set matmul tf32

52765ac

Nanobit commited on Jun 8, 2023

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len

4ac9e25

winglian commited on Jun 6, 2023

Commit History

move is_llama_derived_model into normalize_config (#524) 44454ae unverified

Debug tokenization output: Add ability to output text only (no tokens), and/or specify num samples to see (#511) 48434be unverified

split train from other cli options (#503) b21e4a2 unverified

customizable ascii art (#506) 548787d unverified

tweak: use default config file when only one file is present (#501) 36b2e1c unverified

Refactor train cfg cli (#499) 125cccb unverified

fix: inference did not move the model to the correct device (#483) 17605b8 unverified

ReLoRA implementation (with quantization) (#322) bde3c5a unverified

Ax art (#405) 29241cf unverified

add utils.data.prepare_dataset 2e22404

use context manager to run things on rank0 before others (#397) fc2d6be unverified

Feat(config): add max steps (#387) 3c2ad00 unverified

save tokenizer before training starts (#380) 86a91e2 unverified

simplify `load_tokenizer` efb3b2c

improve GPU logging to break out pytorch cache and system mem 7b55fe6

extract module for working with cfg 8cec513

Attention mask and position id fixes for packing (#285) 2bb0b78 unverified

Fix(save): Save as safetensors (#363) a276c9c unverified

feat(merge): save tokenizer on merge (#362) 289d5c4 unverified

Merge pull request #356 from tmm1/load_model-args 11ddccb unverified

simplify load_model signature 7181022

log GPU memory usage e303d64

fix FSDP save of final model (#329) 894cba0 unverified

misc fixes d75adb9

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var b1f4f7a

Adding logging enhancement 553a86b

Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum 16bb627 unverified

chore: Refactor inf_kwargs out dc77c8e

Merge branch 'main' into flash-optimum fd2c981 unverified

Merge pull request #177 from NanoCode012/fix/landmark-patch 8002ffb unverified

Merge pull request #159 from AngainorDev/patch-1 8e568bb unverified

Fix strict and Lint b565ecf

Fix set mem_id for inference and refactor 974dc00

Set mem cache args on inference 572d114

fix formatting 958da70

pass a prompt in from stdin for inference c4e4f81

Update scripts/finetune.py 759e867 unverified

address PR feedback 0c6f928

add streaming dataset support for pretraining datasets eea2731

more tweaks to do pre-training with bettertransformers 1210dc8

experimental expansion of ctx len 488a67d

add flash attn context for efficient training and attempt setting model to train mode: 8792199

add support for opimum bettertransformers 1edc30c

Merge branch 'main' into patch-1 79e2a6f unverified

Remove explicit definition of cfg.inference c250898 unverified

formatting for linter f36e227 unverified

Add streaming inference & fix stopping at EOS fec6bcc

Feed cfg.inference bd3b537 unverified

Set matmul tf32 52765ac

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len 4ac9e25

move is_llama_derived_model into normalize_config (#524)

44454ae
unverified

Debug tokenization output: Add ability to output text only (no tokens), and/or specify num samples to see (#511)

48434be
unverified

split train from other cli options (#503)

b21e4a2
unverified

customizable ascii art (#506)

548787d
unverified

tweak: use default config file when only one file is present (#501)

36b2e1c
unverified

Refactor train cfg cli (#499)

125cccb
unverified

fix: inference did not move the model to the correct device (#483)

17605b8
unverified

ReLoRA implementation (with quantization) (#322)

bde3c5a
unverified

Ax art (#405)

29241cf
unverified

add utils.data.prepare_dataset

2e22404

use context manager to run things on rank0 before others (#397)

fc2d6be
unverified

Feat(config): add max steps (#387)

3c2ad00
unverified

save tokenizer before training starts (#380)

86a91e2
unverified

simplify `load_tokenizer`

efb3b2c

improve GPU logging to break out pytorch cache and system mem

7b55fe6

extract module for working with cfg

8cec513

Attention mask and position id fixes for packing (#285)

2bb0b78
unverified

Fix(save): Save as safetensors (#363)

a276c9c
unverified

feat(merge): save tokenizer on merge (#362)

289d5c4
unverified

Merge pull request #356 from tmm1/load_model-args

11ddccb
unverified

simplify load_model signature

7181022

log GPU memory usage

e303d64

fix FSDP save of final model (#329)

894cba0
unverified

misc fixes

d75adb9

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var

b1f4f7a

Adding logging enhancement

553a86b

Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum

16bb627
unverified

chore: Refactor inf_kwargs out

dc77c8e

Merge branch 'main' into flash-optimum

fd2c981
unverified

Merge pull request #177 from NanoCode012/fix/landmark-patch

8002ffb
unverified

Merge pull request #159 from AngainorDev/patch-1

8e568bb
unverified

Fix strict and Lint

b565ecf

Fix set mem_id for inference and refactor

974dc00

Set mem cache args on inference

572d114

fix formatting

958da70

pass a prompt in from stdin for inference

c4e4f81

Update scripts/finetune.py

759e867
unverified

address PR feedback

0c6f928

add streaming dataset support for pretraining datasets

eea2731

more tweaks to do pre-training with bettertransformers

1210dc8

experimental expansion of ctx len

488a67d

add flash attn context for efficient training and attempt setting model to train mode:

8792199

add support for opimum bettertransformers

1edc30c

Merge branch 'main' into patch-1

79e2a6f
unverified

Remove explicit definition of cfg.inference

c250898
unverified

formatting for linter

f36e227
unverified

Add streaming inference & fix stopping at EOS

fec6bcc

Feed cfg.inference

bd3b537
unverified

Set matmul tf32

52765ac

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len

4ac9e25