An Yang
yangapku
AI & ML interests
NLP and Deep Learning
Recent Activity
updated
a model
13 days ago
Qwen/Qwen2.5-Coder-7B-Instruct
updated
a model
13 days ago
Qwen/Qwen2.5-Coder-1.5B-Instruct
updated
a model
about 2 months ago
Qwen/Qwen2-VL-72B-Instruct-AWQ
Organizations
yangapku's activity
Create README.md
#1 opened 2 months ago
by
Zhenru
Update README of branch dev_triton.
2
#11 opened 11 months ago
by
Cheshire94
Does Qwen support 16k context, what is the best config for max_new_tokens?
2
#22 opened over 1 year ago
by
Cheshire94
RuntimeError: The size of tensor a (8192) must match the size of tensor b (11581) at non-singleton dimension 3
1
#32 opened over 1 year ago
by
wujiekd
Fix typo
#29 opened over 1 year ago
by
IlysvlVEizbr
Load tokenizer and model in no internet kernel?
1
#33 opened over 1 year ago
by
nikjohn7
FlashAttention推理时还是需要关闭,目前开启输出是错乱的
1
#27 opened over 1 year ago
by
Trangle
我看模型更新了,有说明吗
2
#21 opened over 1 year ago
by
Weiguo
_convert_id_to_token方法没有实现
2
#1 opened over 1 year ago
by
YeungNLP
does it support Chinese and English mixed input?
5
#1 opened almost 2 years ago
by
Baicai003
How can I add context with text input along with the image and the labels?
3
#5 opened almost 2 years ago
by
micole66
remove styling to fix spacing
#4 opened almost 2 years ago
by
akhaliq
Minor nit
1
#3 opened almost 2 years ago
by
osanseviero