Spaces:

Dovakiins
/

qwerrwe

Build error

App Files Files Community

qwerrwe / src

Commit History

misc fixes to add gptq tests (#621)

03e5907
unverified

winglian commited on Sep 22, 2023

split completion text to sequence_len (#616)

97d3776
unverified

winglian commited on Sep 22, 2023

run eval on the first step to get a baseline (#617)

2844eb2
unverified

winglian commited on Sep 22, 2023

skip the gpu memory checks if the device is set to 'auto' (#609)

196ff11
unverified

winglian commited on Sep 21, 2023

fix distributed devices (#612)

2fe95cd
unverified

Maxime commited on Sep 21, 2023

support to disable exllama for gptq (#604)

faecff9
unverified

winglian commited on Sep 19, 2023

Delete duplicate lines (#606)

aa656e0
unverified

bofenghuang commited on Sep 19, 2023

improve handling for empty text on the tokenization step (#502)

1eebbd0
unverified

winglian commited on Sep 19, 2023

Fix for check with cfg and merge_lora (#600)

62a7741
unverified

winglian commited on Sep 19, 2023

minor tweaks to simplify (#597)

31b9e0c
unverified

winglian commited on Sep 18, 2023

btlm and falcon monkey patches for flash attn (#566)

6b9b229
unverified

winglian commited on Sep 17, 2023

add bf16 check (#587)

131afdb
unverified

winglian commited on Sep 17, 2023

Feat(data): Allow loading local csv and text (#594)

00dce35
unverified

Nanobit commited on Sep 17, 2023

gather/broadcast the max value of the packing efficiency automatically (#463)

b15b19e
unverified

winglian commited on Sep 17, 2023

don't add position_ids for evals (#591)

ab534d7
unverified

winglian commited on Sep 16, 2023

optionally configure sample packing for evals (#589)

21ec195
unverified

winglian commited on Sep 16, 2023

make phi training work with Loras (#588)

62eaee7
unverified

winglian commited on Sep 16, 2023

set fsdp state dict (#584)

be75668
unverified

Jan Philipp Harries Jan Philipp Harries commited on Sep 15, 2023

pop block_cls since it's not an actual kwarg

aeec7c4

winglian commited on Sep 15, 2023

don't resize embeddings if it's already large enough (#577)

3607882
unverified

winglian commited on Sep 15, 2023

Support Sample packing for phi arch (#586)

12a2dbb
unverified

winglian commited on Sep 15, 2023

support custom field for completion from yml (#580)

f7a2263
unverified

winglian commited on Sep 15, 2023

prevent cli functions from getting fired on import (#581)

8dcd40a
unverified

winglian commited on Sep 15, 2023

refactor scripts/finetune.py into new cli modules (#550)

861ceca
unverified

Nanobit commited on Sep 15, 2023

E2e device cuda (#575)

2414673
unverified

winglian commited on Sep 15, 2023

mypy wandb ignore (#572)

c6d870b
unverified

winglian commited on Sep 14, 2023

remove columns after tokenizing for pretraining (#571)

1157950
unverified

winglian commited on Sep 14, 2023

fix save_steps so it doesn't get duplicated (#567)

3fbde76
unverified

winglian commited on Sep 14, 2023

Model parallel (#538)

f6060a6
unverified

winglian commited on Sep 13, 2023

let hf trainer handle torch compile (#516)

a4e1bb6
unverified

tmm1 commited on Sep 13, 2023

improve how we setup eval/save strategies and steps (#547)

36e53c7
unverified

winglian commited on Sep 13, 2023

gracefully handle length feature used for group by (#565)

e7aa7b1
unverified

winglian commited on Sep 13, 2023

add optimization for group-by-len (#563)

e5bb22a
unverified

winglian commited on Sep 13, 2023

fix wandb so mypy doesn't complain (#562)

bf08044
unverified

winglian commited on Sep 13, 2023

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

Glavin001 commited on Sep 13, 2023

Fix pretraining with iterable/streaming Dataset (#556)

2f586d1
unverified

Jan Philipp Harries Jan Philipp Harries commited on Sep 13, 2023

fix for quant config from model (#540)

a94f9cb
unverified

winglian commited on Sep 10, 2023

workaround for md5 variations (#533)

0b4cf5b
unverified

winglian commited on Sep 8, 2023

Early stopping metric (#537)

e30f1e3
unverified

winglian commited on Sep 8, 2023

recommend padding when using sample packing (#531)

3437149
unverified

winglian commited on Sep 6, 2023

log rank too (#527)

245c5c4
unverified

winglian commited on Sep 6, 2023

misc fixes/improvements (#513)

a546ca2
unverified

winglian commited on Sep 5, 2023

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

winglian commited on Sep 5, 2023

Merge pull request #520 from bdashore3/sharegpt-fixes

daa4fac
unverified

mhenrichsen commited on Sep 5, 2023

reorg a bit

fc8766e

tmm1 commited on Sep 5, 2023

use flash_attn rmsnorm when available (#526)

72a6fe1
unverified

tmm1 commited on Sep 4, 2023

use flash_attn xentropy when available (#525)

5fe30b1
unverified

tmm1 commited on Sep 4, 2023

move is_llama_derived_model into normalize_config (#524)

44454ae
unverified

tmm1 commited on Sep 4, 2023

No gather single gpu (#523)

09f1543
unverified

winglian commited on Sep 4, 2023

Prompters: ShareGPT: Allow for custom system prompts

995557b

kingbri commited on Sep 1, 2023