L56-D1920-qwen_gdn_qwen2-e1-nh6-hd64-nvh30-A0-S4096-step1-rand2b-token2B

This is a model uploaded from /mnt/yulan_pretrain/gaoyanzipeng/models/distill/L56-D1920-qwen_gdn_qwen2-e1-nh6-hd64-nvh30-A0-S4096-step1-rand2b-token2B.

Downloads last month
2
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support