Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
411293b
qwerrwe
/
src
/
axolotl
/
monkeypatch
100 contributors
History:
60 commits
winglian
relora: magnitude pruning of the optimizer (#1245)
8c2e05a
unverified
about 1 year ago
data
support for true batches with multipack (#1230)
about 1 year ago
falcon
Falcon embeddings (#1149) [skip docker]
about 1 year ago
mixtral
Mixtral fixes 20240124 (#1192) [skip ci]
about 1 year ago
phi
Phi2 multipack (#1173)
about 1 year ago
qwen2
Qwen2 (#1166)
about 1 year ago
btlm_attn_hijack_flash.py
Safe
2.32 kB
flash_attention + sample packing for stablelm 3b (#671)
over 1 year ago
fastchat_conversation_turns.py
Safe
8.33 kB
Added chatglm3 conversation type for training models like TinyLLama (#1036)
about 1 year ago
llama_attn_hijack_flash.py
Safe
32 kB
Add shifted sparse attention (#973) [skip-ci]
about 1 year ago
llama_attn_hijack_xformers.py
Safe
5.69 kB
various bugfixes (#856)
about 1 year ago
llama_expand_mask.py
Safe
672 Bytes
support for true batches with multipack (#1230)
about 1 year ago
llama_patch_multipack.py
Safe
1.1 kB
support for true batches with multipack (#1230)
about 1 year ago
mistral_attn_hijack_flash.py
Safe
22.5 kB
Respect sliding_window=None (#1214)
about 1 year ago
relora.py
16.7 kB
relora: magnitude pruning of the optimizer (#1245)
about 1 year ago
stablelm_attn_hijack_flash.py
Safe
15.4 kB
flash_attention + sample packing for stablelm 3b (#671)
over 1 year ago
utils.py
8.01 kB
support for true batches with multipack (#1230)
about 1 year ago