interesting - a smpanaro Collection

smpanaro 's Collections

Apple Neural Engine LLMs

quant

prune

interesting

updated Aug 2

LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Paper • 2401.01325 • Published Jan 2 • 26
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation

Paper • 2312.14187 • Published Dec 20, 2023 • 49
Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Paper • 2401.10891 • Published Jan 19 • 59
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

Paper • 2404.06395 • Published Apr 9 • 21
Flash normalization: fast RMSNorm for LLMs

Paper • 2407.09577 • Published Jul 12 • 1
Pruning Large Language Models with Semi-Structural Adaptive Sparse Training

Paper • 2407.20584 • Published Jul 30