Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
fccb542
qwerrwe
/
src
/
axolotl
/
monkeypatch
100 contributors
History:
53 commits
winglian
Multipack simplify for Mixtral (#1142)
6910e6a
unverified
11 months ago
mixtral
Multipack simplify for Mixtral (#1142)
11 months ago
btlm_attn_hijack_flash.py
Safe
2.32 kB
flash_attention + sample packing for stablelm 3b (#671)
about 1 year ago
fastchat_conversation_turns.py
Safe
8.33 kB
Added chatglm3 conversation type for training models like TinyLLama (#1036)
11 months ago
llama_attn_hijack_flash.py
Safe
32 kB
Add shifted sparse attention (#973) [skip-ci]
11 months ago
llama_attn_hijack_sdp.py
Safe
4.81 kB
various bugfixes (#856)
about 1 year ago
llama_attn_hijack_xformers.py
Safe
5.69 kB
various bugfixes (#856)
about 1 year ago
llama_expand_mask.py
Safe
1.92 kB
Attention mask and position id fixes for packing (#285)
over 1 year ago
mistral_attn_hijack_flash.py
Safe
22.4 kB
adds llama and mistral dropout support (#858)
about 1 year ago
relora.py
Safe
14 kB
fix checkpints on multigpu (#481)
over 1 year ago
stablelm_attn_hijack_flash.py
Safe
15.4 kB
flash_attention + sample packing for stablelm 3b (#671)
about 1 year ago
utils.py
Safe
5.3 kB
Multipack simplify for Mixtral (#1142)
11 months ago