Llama-2-7b-AQLM-2Bit-1x16-hf / modeling_llama_aqlm.py

Commit History

try except flash-attn
f48478c

Andrei Panferov commited on

inference lib
03ea233

Andrei Panferov commited on

new code
dfb8eb3

Andrei Panferov commited on