qwerrwe / src

Commit History

experimental llama 2 chat support (#296)
3392270
unverified

Jan Philipp Harries Jan Philipp Harries commited on

Update XFormers Attention Monkeypatch to handle Llama-2 70B (GQA) (#339)
10405b9
unverified

ssmi153 commited on

Added Orca Mini prompt strategy (#263)
c93655c
unverified

Jan Philipp Harries Jan Philipp Harries commited on

optimize the iteration when tokenizeing large datasets (#332)
fe28543
unverified

winglian commited on

fix typo
2eda9e0

tmm1 commited on

scope flash-attn+qlora fix correctly, scope to llama, add comment
78b9efb

tmm1 commited on

move flash-attn monkey patch alongside the others
312a9fa

tmm1 commited on

ensure flash-attn fixes happen in both adapter/lora modes, and use torch_dtype
248bf90

tmm1 commited on

qlora w flash attention fixes (#333)
77085ea
unverified

winglian commited on

add peft install back since it doesn't get installed by setup.py (#331)
db2a358
unverified

winglian commited on

update prompts for open orca to match the paper (#317)
3d4984b
unverified

winglian commited on

Merge pull request #307 from OpenAccess-AI-Collective/xgen-user-sharegpt-tokens
40a53ff
unverified

winglian commited on

Merge pull request #313 from OpenAccess-AI-Collective/tokenizer-llama2-embeddings
3ffb018
unverified

winglian commited on

don't resize embeddings to multiples of 32x by default
1066751

winglian commited on

better handling since xgen tokenizer breaks with convert_tokens_to_ids
2a428e8

winglian commited on

flash attention 2
9b790d3

winglian commited on

fix sdp attention to use the flash/mem-efficient context manaager
a032c9f

winglian commited on

feat: use multi-core
45ac7c4

Nanobit commited on

fix axolotl training args dataclass annotation
ebaec3c

winglian commited on

misc fixes
d75adb9

winglian commited on

Merge pull request #276 from theobjectivedad/logging_enhancement
6f16c45
unverified

winglian commited on

Fixed pre-commit problems, fixed small bug in logging_config to handle LOG_LEVEL env var
b1f4f7a

theobjectivedad commited on

Merge branch 'OpenAccess-AI-Collective:main' into logging_enhancement
83237b8
unverified

The Objective Dad commited on

Add ability to pass 'name' argument to load_dataset
88089e8

chargoddard commited on

Merge pull request #274 from OpenAccess-AI-Collective/NanoCode012-patch-2
168a7a0
unverified

Nanobit commited on

Update log message format, IMO this is easier to read.
9234b75

theobjectivedad commited on

Feat: Add save_safetensors
5491278

Nanobit commited on

Set push to hub as private by default
1514739
unverified

Nanobit commited on

support for loading a model by git revision
69a2350

winglian commited on

Merge branch 'main' into quadratic-warmup
c4cf567
unverified

winglian commited on

better configuration for quadratic warmup
c49729d

winglian commited on

params are adam_*, not adamw_*
19cf0bd

winglian commited on

skip explicit model type too if using trust_remote_code
d69da99

winglian commited on

don't use llama if trust_remote_code is set since that needs to use AutoModel path
66afb76

winglian commited on

Merge pull request #221 from utensil/local_dataset
b9b7d4c
unverified

winglian commited on

Fix future deprecation push_to_hub_model_id
e79c8e6

Nanobit commited on

Merge pull request #255 from OpenAccess-AI-Collective/open-orca-prompts
1e5014a
unverified

winglian commited on

Merge pull request #246 from OpenAccess-AI-Collective/sys-prompts-instruct
4066c78
unverified

winglian commited on

open orca support
78a1e1f

winglian commited on

Fix typing list
77bdb7d
unverified

Nanobit commited on

add option for instruct w sys prompts
924bbfd

winglian commited on

Merge pull request #224 from OpenAccess-AI-Collective/system-prompt-data
f150c02
unverified

winglian commited on

push intermediate model checkpoints to hub
612aabd

winglian commited on

skip the system prompt
05ab909

winglian commited on

pylint for duplicated code for system prompts
7b57ed7

winglian commited on

add tests and supoort for loader for sys prompt data
3a38271

winglian commited on

initial wip to get sys prompt from dataset
8d20e0a

winglian commited on

optionally define whether to use_fast tokenizer
47d601f

winglian commited on

Support loading data files from a local directory
9bdd30c

utensil commited on