qwerrwe / tests /monkeypatch /test_llama_attn_hijack_flash.py

Commit History

support for true batches with multipack (#1230)
00568c1
unverified

winglian commited on

Multipack simplify for Mixtral (#1142)
6910e6a
unverified

winglian commited on

Attention mask and position id fixes for packing (#285)
2bb0b78
unverified

winglian commited on