llm-jp/optimal-sparsity-math-d2048-E128-k16-52.2B-A7.1B Text Generation • 52B • Updated about 18 hours ago • 20
llm-jp/optimal-sparsity-math-d2048-E64-k16-26.4B-A7.1B Text Generation • 26B • Updated about 18 hours ago • 15
llm-jp/optimal-sparsity-math-d2048-E32-k16-13.6B-A7.1B Text Generation • 14B • Updated about 18 hours ago • 18
llm-jp/optimal-sparsity-math-d2048-E16-k16-7.1B-A7.1B Text Generation • 7B • Updated about 18 hours ago • 21
llm-jp/optimal-sparsity-math-d1024-E256-k16-26.0B-A1.9B Text Generation • 26B • Updated about 18 hours ago • 17
llm-jp/optimal-sparsity-math-d1024-E128-k16-13.2B-A1.9B Text Generation • 13B • Updated about 18 hours ago • 21
llm-jp/optimal-sparsity-math-d1024-E64-k16-6.7B-A1.9B Text Generation • 7B • Updated about 18 hours ago • 24
llm-jp/optimal-sparsity-math-d1024-E32-k16-3.5B-A1.9B Text Generation • 3B • Updated about 18 hours ago • 18
llm-jp/optimal-sparsity-math-d1024-E16-k16-1.9B-A1.9B Text Generation • 2B • Updated about 18 hours ago • 19
llm-jp/optimal-sparsity-math-d512-E256-k16-6.6B-A520M Text Generation • 7B • Updated about 18 hours ago • 21
llm-jp/optimal-sparsity-math-d512-E128-k16-3.3B-A520M Text Generation • 3B • Updated about 18 hours ago • 19
llm-jp/optimal-sparsity-math-d512-E64-k16-1.7B-A520M Text Generation • 2B • Updated about 18 hours ago • 19
llm-jp/optimal-sparsity-math-d512-E32-k16-920M-A520M Text Generation • 0.9B • Updated about 18 hours ago • 19
llm-jp/optimal-sparsity-math-d512-E16-k16-520M-A520M Text Generation • 0.5B • Updated about 18 hours ago • 17
llm-jp/optimal-sparsity-math-d2048-E128-k8-52.2B-A3.9B Text Generation • 52B • Updated about 18 hours ago • 17
llm-jp/optimal-sparsity-math-d2048-E64-k8-26.4B-A3.9B Text Generation • 26B • Updated about 18 hours ago • 17
llm-jp/optimal-sparsity-math-d2048-E32-k8-13.6B-A3.9B Text Generation • 14B • Updated about 18 hours ago • 18
llm-jp/optimal-sparsity-math-d2048-E16-k8-7.1B-A3.9B Text Generation • 7B • Updated about 18 hours ago • 18
llm-jp/optimal-sparsity-math-d2048-E8-k8-3.9B-A3.9B Text Generation • 4B • Updated about 18 hours ago • 23
llm-jp/optimal-sparsity-math-d1024-E256-k8-26.0B-A1.1B Text Generation • 26B • Updated about 18 hours ago • 17