Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Qwen
/
Qwen-1_8B
like
61
Follow
Qwen
5.88k
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
arxiv:
2309.16609
arxiv:
2305.08322
arxiv:
2009.03300
Model card
Files
Files and versions
Community
3
Train
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (1)
Intermediate_size is doubled in config.json
1
#3 opened 8 months ago by
hafezmg48
qwen-1_8B和qwen-7b辅助解码失败
#2 opened about 1 year ago by
rollinginthedeep