Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning Paper • 2502.17407 • Published 4 days ago • 22
KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models Paper • 2412.06071 • Published Dec 8, 2024 • 9
Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA Paper • 2410.20672 • Published Oct 28, 2024 • 6
Running 164 164 Low-bit Quantized Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots
QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference Paper • 2402.10076 • Published Feb 15, 2024 • 2