JianghuLu
wnma3mz
AI & ML interests
None yet
Recent Activity
liked
a model
21 days ago
genmo/mochi-1-preview
Organizations
None yet
wnma3mz's activity
How much data does 32k have?
#5 opened 4 months ago
by
wnma3mz
Why the intermediate_size of Qwen1.5-MoE-A2.7B is different from Qwen-1.8B?
3
#5 opened 5 months ago
by
ShiKeNLP
tokenizer chat_template has no role system
2
#9 opened 5 months ago
by
wnma3mz
How to transform the existing 1.8B into Qwen1.5-MoE-A2.7B?
2
#1 opened 8 months ago
by
wnma3mz
How to transform the existing 1.8B into Qwen1.5-MoE-A2.7B?
2
#1 opened 8 months ago
by
wnma3mz