llm-jp 's Collections

Optimal Sparsity Code

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks