L56-D1920-qwen_gdn_qwen2-e1-nh6-hd64-nvh30-A0-S4096-step1-rand2b-nolearn-o-token1B
This is a model uploaded from /mnt/nanjingcephfs/project_wx-rec-alg-bdc-exp/bwzheng/yulan/hyw/pretrain-linear-moe-dev/RADLADS-paper/out/L56-D1920-qwen_gdn_qwen2-e1-nh6-hd64-nvh30-A0-S4096--step1-rand2b-nolearn-o-token1B.
- Downloads last month
- 3
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support