Qwen-14B-Chat-Int4 / modeling_qwen.py

Commit History

remove fix-sized causal mask

6e72378

yangapku commited on Nov 14, 2023

add kernel file check in modeling_qwen.py

0374c21

yangapku commited on Nov 5, 2023

update modeling.py

f5e4b21

yangapku commited on Oct 26, 2023

update modeling_qwen.py

5d28542

yangapku commited on Oct 16, 2023

update batch inference

4b4dcdc

yangapku commited on Oct 14, 2023

softmax_in_fp32

6c6ec1c

yangapku commited on Sep 28, 2023

update modeling_qwen.py

5d52159

yangapku commited on Sep 27, 2023

update kernels

b980709

yangapku commited on Sep 27, 2023

update modeling_qwen.py

45eb93c

yangapku commited on Sep 26, 2023

update modeling_qwen.py

0f5e18f

yangapku commited on Sep 25, 2023

update kvcache

a828abf

yangapku commited on Sep 25, 2023

update readme

f47dcd2

yangapku commited on Sep 24, 2023

update batch infer

d83208a

yangapku commited on Sep 24, 2023

upload model

ac4ce9b

yangapku commited on Sep 24, 2023

Commit History

remove fix-sized causal mask 6e72378

add kernel file check in modeling_qwen.py 0374c21

update modeling.py f5e4b21

update modeling_qwen.py 5d28542

update batch inference 4b4dcdc

softmax_in_fp32 6c6ec1c

update modeling_qwen.py 5d52159

update kernels b980709

update modeling_qwen.py 45eb93c

update modeling_qwen.py 0f5e18f

update kvcache a828abf

update readme f47dcd2

update batch infer d83208a

upload model ac4ce9b

remove fix-sized causal mask

6e72378

add kernel file check in modeling_qwen.py

0374c21

update modeling.py

f5e4b21

update modeling_qwen.py

5d28542

update batch inference

4b4dcdc

softmax_in_fp32

6c6ec1c

update modeling_qwen.py

5d52159

update kernels

b980709

update modeling_qwen.py

45eb93c

update modeling_qwen.py

0f5e18f

update kvcache

a828abf

update readme

f47dcd2

update batch infer

d83208a

upload model

ac4ce9b