Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
8da1633
qwerrwe
/
src
/
axolotl
/
monkeypatch
100 contributors
History:
58 commits
DreamGenX
Respect sliding_window=None (#1214)
62ca4a2
unverified
10 months ago
falcon
Falcon embeddings (#1149) [skip docker]
11 months ago
mixtral
Mixtral fixes 20240124 (#1192) [skip ci]
10 months ago
phi
Phi2 multipack (#1173)
11 months ago
qwen2
Qwen2 (#1166)
11 months ago
btlm_attn_hijack_flash.py
Safe
2.32 kB
flash_attention + sample packing for stablelm 3b (#671)
about 1 year ago
fastchat_conversation_turns.py
Safe
8.33 kB
Added chatglm3 conversation type for training models like TinyLLama (#1036)
11 months ago
llama_attn_hijack_flash.py
Safe
32 kB
Add shifted sparse attention (#973) [skip-ci]
11 months ago
llama_attn_hijack_sdp.py
Safe
4.81 kB
various bugfixes (#856)
about 1 year ago
llama_attn_hijack_xformers.py
Safe
5.69 kB
various bugfixes (#856)
about 1 year ago
llama_expand_mask.py
Safe
1.92 kB
Attention mask and position id fixes for packing (#285)
over 1 year ago
mistral_attn_hijack_flash.py
Safe
22.5 kB
Respect sliding_window=None (#1214)
10 months ago
relora.py
Safe
14 kB
fix checkpints on multigpu (#481)
over 1 year ago
stablelm_attn_hijack_flash.py
Safe
15.4 kB
flash_attention + sample packing for stablelm 3b (#671)
about 1 year ago
utils.py
Safe
5.3 kB
Multipack simplify for Mixtral (#1142)
11 months ago