jayzou3773/qwen3-moe-expert_drop-pure_gradient_pruning-r64-s1k-128samples-thinking 16B • Updated 27 days ago • 118
jayzou3773/qwen3-moe-expert_drop-pure_expert_gradient_pruning-r64-s1k-128samples-thinking 16B • Updated 27 days ago • 146
jayzou3773/qwen3-moe-expert_drop-layerwise_pruning-r64-s1k-128samples-thinking 16B • Updated 27 days ago • 164
jayzou3773/qwen3-moe-expert_drop-bias_pruning-r64-s1k-128samples-thinking 16B • Updated 27 days ago • 160
jayzou3773/qwen3-moe-neuron_structure_drop-p50-s1k-128samples-thinking 16B • Updated 27 days ago • 292
jayzou3773/qwen3_5-moe-neuron_structure_drop-p50-s1k-128samples 19B • Updated about 1 month ago • 187
jayzou3773/qwen3_5-moe-expert_drop-weight_magnitude_pruning-r128-s1k-128samples 19B • Updated about 1 month ago • 129
jayzou3773/qwen3_5-moe-expert_drop-pure_gradient_pruning-r128-s1k-128samples 19B • Updated about 1 month ago • 61
jayzou3773/qwen3_5-moe-expert_drop-pure_expert_gradient_pruning-r128-s1k-128samples 19B • Updated about 1 month ago • 114
jayzou3773/qwen3_5-moe-expert_drop-layerwise_pruning-r128-s1k-128samples 19B • Updated about 1 month ago • 95
jayzou3773/qwen3_5-moe-expert_drop-bias_pruning-r128-s1k-128samples 19B • Updated about 1 month ago • 95
jayzou3773/qwen3-moe-expert_drop-weight_magnitude_pruning-r64-s1k-128samples 16B • Updated about 1 month ago • 36
jayzou3773/qwen3-moe-expert_drop-pure_gradient_pruning-r64-s1k-128samples 16B • Updated about 1 month ago • 26
jayzou3773/qwen3-moe-expert_drop-pure_expert_gradient_pruning-r64-s1k-128samples 16B • Updated about 1 month ago • 39
jayzou3773/qwen3-moe-expert_drop-layerwise_pruning-r64-s1k-128samples 16B • Updated about 1 month ago • 37
jayzou3773/qwen3-moe-expert_drop-bias_pruning-r64-s1k-128samples 16B • Updated about 1 month ago • 36