Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen-7B
like
369
Follow
Qwen
3,967
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
arxiv:
2309.16609
License:
tongyi-qianwen-license-agreement
Model card
Files
Files and versions
Community
21
Train
Use this model
f0cf652
Qwen-7B
2 contributors
History:
12 commits
yangapku
fix flash-attention usage
f0cf652
over 1 year ago
assets
upload resource files
over 1 year ago
.gitattributes
Safe
1.52 kB
initial commit
over 1 year ago
LICENSE
Safe
6.9 kB
upload resource files
over 1 year ago
NOTICE
Safe
1.56 kB
upload resource files
over 1 year ago
README.md
Safe
20.9 kB
fix flash-attention usage
over 1 year ago
config.json
Safe
1.11 kB
fix flash-attention usage
over 1 year ago
configuration_qwen.py
Safe
2.33 kB
upload resource files
over 1 year ago
generation_config.json
Safe
361 Bytes
upload resource files
over 1 year ago
modeling_qwen.py
Safe
37.3 kB
fix flash-attention usage
over 1 year ago
pytorch_model.bin
Safe
pickle
Detected Pickle imports (3)
"collections.OrderedDict"
,
"torch.BFloat16Storage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
15.4 GB
LFS
upload resource files
over 1 year ago
qwen.tiktoken
Safe
2.56 MB
upload resource files
over 1 year ago
qwen_generation_utils.py
Safe
14.4 kB
upload resource files
over 1 year ago
tokenization_qwen.py
Safe
8.08 kB
update tokenization_qwen.py
over 1 year ago
tokenizer_config.json
Safe
196 Bytes
upload resource files
over 1 year ago