Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen-14B-Chat-Int4
like
101
Follow
Qwen
10.6k
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
4-bit precision
gptq
arxiv:
5 papers
Model card
Files
Files and versions
Community
9
Train
Use this model
6e72378
Qwen-14B-Chat-Int4
/
modeling_qwen.py
Commit History
remove fix-sized causal mask
6e72378
yangapku
commited on
Nov 14, 2023
add kernel file check in modeling_qwen.py
0374c21
yangapku
commited on
Nov 5, 2023
update modeling.py
f5e4b21
yangapku
commited on
Oct 26, 2023
update modeling_qwen.py
5d28542
yangapku
commited on
Oct 16, 2023
update batch inference
4b4dcdc
yangapku
commited on
Oct 14, 2023
softmax_in_fp32
6c6ec1c
yangapku
commited on
Sep 28, 2023
update modeling_qwen.py
5d52159
yangapku
commited on
Sep 27, 2023
update kernels
b980709
yangapku
commited on
Sep 27, 2023
update modeling_qwen.py
45eb93c
yangapku
commited on
Sep 26, 2023
update modeling_qwen.py
0f5e18f
yangapku
commited on
Sep 25, 2023
update kvcache
a828abf
yangapku
commited on
Sep 25, 2023
update readme
f47dcd2
yangapku
commited on
Sep 24, 2023
update batch infer
d83208a
yangapku
commited on
Sep 24, 2023
upload model
ac4ce9b
yangapku
commited on
Sep 24, 2023