Commits · Dovakiins/qwerrwe

Merge pull request #182 from OpenAccess-AI-Collective/fix-llama-ref

0124825
unverified

winglian commited on Jun 10, 2023

address PR feedback

0c6f928

winglian commited on Jun 10, 2023

add streaming dataset support for pretraining datasets

eea2731

winglian commited on Jun 10, 2023

more gpt-neox long ctx fixes

ab5cd28

winglian commited on Jun 1, 2023

fix bettertransformers save, force it to skip after saving correctly in callback

1a82082

winglian commited on Jun 1, 2023

more tweaks to do pre-training with bettertransformers

1210dc8

winglian commited on Jun 1, 2023

experimental expansion of ctx len

488a67d

winglian commited on May 31, 2023

add validation/warning for bettertransformers and torch version

71a43f8

winglian commited on May 28, 2023

add support for opimum bettertransformers

1edc30c

winglian commited on May 27, 2023

fix for local variable 'LlamaForCausalLM' referenced before assignment

14163c1

winglian commited on Jun 10, 2023

Merge branch 'main' into patch-1

79e2a6f
unverified

Angainor Development commited on Jun 10, 2023

add support to extend context with xpos rope

a03a7d7

winglian commited on Jun 10, 2023

fix for max sequence len across different model types

7f09106

winglian commited on Jun 10, 2023

Fix backward compat for peft

aefb2fc

Nanobit commited on Jun 9, 2023

WIP: Rely on cfg.inference

813cfa4
unverified

Angainor Development commited on Jun 9, 2023

Fix grad checkpoint and outputs param

2a801b0

Nanobit commited on Jun 9, 2023

Fix patching via import instead of hijacking

e44c9e0

Nanobit commited on Jun 9, 2023

Feat: Add landmark attention

55b8542

Nanobit commited on Jun 9, 2023

Disable Wandb

f4df266

Bruno Cabral commited on Jun 9, 2023

Refactor out unmodified save_steps and eval_steps

2ef4634

Nanobit commited on Jun 8, 2023

Set to use cfg.seed or 42 for backward compat

2cfe9e9

Nanobit commited on Jun 8, 2023

Fix failing test

bfd27ba

Nanobit commited on Jun 8, 2023

Validate falcon with fsdp

babf0fd

Nanobit commited on Jun 8, 2023

Fix future deprecate prepare_model_for_int8_training

df9528f

Nanobit commited on Jun 2, 2023

Fix training over existing lora

193c73b
unverified

Angainor Development commited on Jun 8, 2023

fix camel ai, add guanaco/oasst mapping for sharegpt

59bb219

winglian commited on Jun 7, 2023

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len

4ac9e25

winglian commited on Jun 6, 2023

Update doc for grad_accu and add validation tests for batch size

3c71c8d

Nanobit commited on May 31, 2023

fix batch size calculation

5a631b3

winglian commited on May 31, 2023

fix packing so that concatenated sequences reset the attention

9b8585d

winglian commited on May 31, 2023

Merge pull request #124 from OpenAccess-AI-Collective/xformers-fix

2d0ba3b
unverified

winglian commited on May 31, 2023

Merge pull request #120 from OpenAccess-AI-Collective/model-from-path

c7021e1
unverified

winglian commited on May 31, 2023

don't worry about dupes

c56818b

winglian commited on May 31, 2023

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py

1076bcb
unverified

winglian

Nanobit commited on May 31, 2023

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py

2daa683
unverified

winglian

Nanobit commited on May 31, 2023

remove unused import and update readme

e3c494c

winglian commited on May 31, 2023

black formatting

ad0ea6a

winglian commited on May 31, 2023

copy xformers attn from ooba since we removed dep on alpaca_lora_4bit

6cb2310

winglian commited on May 31, 2023

add support for gradient accumulation steps

3aad5f3

winglian commited on May 31, 2023

fix up tokenizer config, isort fix

39a208c

winglian commited on May 31, 2023

split up llama model loading so config can be loaded from base config and models can be loaded from a path

2520ecd

winglian commited on May 31, 2023

Fix incorrect rebase

594e72b

Nanobit commited on May 30, 2023

Fix sharegpt prompt

25eeeeb

Nanobit commited on May 30, 2023

fix relative path for fixtures

cfcc549

winglian commited on May 30, 2023

Fix security issue or ignore false positives

a1f9850

Nanobit commited on May 29, 2023

Update src/axolotl/prompt_strategies/alpaca_instruct.py

c17dae6

Nanobit

winglian commited on May 29, 2023

Apply isort then black

37293dc

Nanobit commited on May 29, 2023

Fix mypy typing

e9650d3

Nanobit commited on May 29, 2023

Fix unsupported operand type(s) for |

be22551

Nanobit commited on May 29, 2023

Black formatting

b832a0a

Nanobit commited on May 29, 2023

Commit History

Merge pull request #182 from OpenAccess-AI-Collective/fix-llama-ref 0124825 unverified

address PR feedback 0c6f928

add streaming dataset support for pretraining datasets eea2731

more gpt-neox long ctx fixes ab5cd28

fix bettertransformers save, force it to skip after saving correctly in callback 1a82082

more tweaks to do pre-training with bettertransformers 1210dc8

experimental expansion of ctx len 488a67d

add validation/warning for bettertransformers and torch version 71a43f8

add support for opimum bettertransformers 1edc30c

fix for local variable 'LlamaForCausalLM' referenced before assignment 14163c1

Merge branch 'main' into patch-1 79e2a6f unverified

add support to extend context with xpos rope a03a7d7

fix for max sequence len across different model types 7f09106

Fix backward compat for peft aefb2fc

WIP: Rely on cfg.inference 813cfa4 unverified

Fix grad checkpoint and outputs param 2a801b0

Fix patching via import instead of hijacking e44c9e0

Feat: Add landmark attention 55b8542

Disable Wandb f4df266

Refactor out unmodified save_steps and eval_steps 2ef4634

Set to use cfg.seed or 42 for backward compat 2cfe9e9

Fix failing test bfd27ba

Validate falcon with fsdp babf0fd

Fix future deprecate prepare_model_for_int8_training df9528f

Fix training over existing lora 193c73b unverified

fix camel ai, add guanaco/oasst mapping for sharegpt 59bb219

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len 4ac9e25

Update doc for grad_accu and add validation tests for batch size 3c71c8d

fix batch size calculation 5a631b3

fix packing so that concatenated sequences reset the attention 9b8585d

Merge pull request #124 from OpenAccess-AI-Collective/xformers-fix 2d0ba3b unverified

Merge pull request #120 from OpenAccess-AI-Collective/model-from-path c7021e1 unverified

don't worry about dupes c56818b

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py 1076bcb unverified

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py 2daa683 unverified

remove unused import and update readme e3c494c

black formatting ad0ea6a

copy xformers attn from ooba since we removed dep on alpaca_lora_4bit 6cb2310

add support for gradient accumulation steps 3aad5f3

fix up tokenizer config, isort fix 39a208c

split up llama model loading so config can be loaded from base config and models can be loaded from a path 2520ecd

Fix incorrect rebase 594e72b

Fix sharegpt prompt 25eeeeb

fix relative path for fixtures cfcc549

Fix security issue or ignore false positives a1f9850

Update src/axolotl/prompt_strategies/alpaca_instruct.py c17dae6

Apply isort then black 37293dc

Fix mypy typing e9650d3

Fix unsupported operand type(s) for | be22551

Black formatting b832a0a

Merge pull request #182 from OpenAccess-AI-Collective/fix-llama-ref

0124825
unverified

address PR feedback

0c6f928

add streaming dataset support for pretraining datasets

eea2731

more gpt-neox long ctx fixes

ab5cd28

fix bettertransformers save, force it to skip after saving correctly in callback

1a82082

more tweaks to do pre-training with bettertransformers

1210dc8

experimental expansion of ctx len

488a67d

add validation/warning for bettertransformers and torch version

71a43f8

add support for opimum bettertransformers

1edc30c

fix for local variable 'LlamaForCausalLM' referenced before assignment

14163c1

Merge branch 'main' into patch-1

79e2a6f
unverified

add support to extend context with xpos rope

a03a7d7

fix for max sequence len across different model types

7f09106

Fix backward compat for peft

aefb2fc

WIP: Rely on cfg.inference

813cfa4
unverified

Fix grad checkpoint and outputs param

2a801b0

Fix patching via import instead of hijacking

e44c9e0

Feat: Add landmark attention

55b8542

Disable Wandb

f4df266

Refactor out unmodified save_steps and eval_steps

2ef4634

Set to use cfg.seed or 42 for backward compat

2cfe9e9

Fix failing test

bfd27ba

Validate falcon with fsdp

babf0fd

Fix future deprecate prepare_model_for_int8_training

df9528f

Fix training over existing lora

193c73b
unverified

fix camel ai, add guanaco/oasst mapping for sharegpt

59bb219

new prompters, misc fixes for output dir missing using fsdp, and changing max seq len

4ac9e25

Update doc for grad_accu and add validation tests for batch size

3c71c8d

fix batch size calculation

5a631b3

fix packing so that concatenated sequences reset the attention

9b8585d

Merge pull request #124 from OpenAccess-AI-Collective/xformers-fix

2d0ba3b
unverified

Merge pull request #120 from OpenAccess-AI-Collective/model-from-path

c7021e1
unverified

don't worry about dupes

c56818b

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py

1076bcb
unverified

Update src/axolotl/monkeypatch/llama_attn_hijack_xformers.py

2daa683
unverified

remove unused import and update readme

e3c494c

black formatting

ad0ea6a

copy xformers attn from ooba since we removed dep on alpaca_lora_4bit

6cb2310

add support for gradient accumulation steps

3aad5f3

fix up tokenizer config, isort fix

39a208c

split up llama model loading so config can be loaded from base config and models can be loaded from a path

2520ecd

Fix incorrect rebase

594e72b

Fix sharegpt prompt

25eeeeb

fix relative path for fixtures

cfcc549

Fix security issue or ignore false positives

a1f9850

Update src/axolotl/prompt_strategies/alpaca_instruct.py

c17dae6

Apply isort then black

37293dc

Fix mypy typing

e9650d3

Fix unsupported operand type(s) for |

be22551

Black formatting

b832a0a