Fantastic Pretraining Optimizers and Where to Find Them Paper โข 2509.02046 โข Published 2 days ago โข 8
Benchmarking Optimizers for Large Language Model Pretraining Paper โข 2509.01440 โข Published 3 days ago โข 17
POINTS-Reader: Distillation-Free Adaptation of Vision-Language Models for Document Conversion Paper โข 2509.01215 โข Published 3 days ago โข 42
Hybrid Linear Attention Research Collection All 1.3B & 340M hybrid linear-attention experiments. โข 60 items โข Updated Jul 7 โข 12
view post Post 4454 Run OpenAI's new gpt-oss models locally with Unsloth GGUFs! ๐ฅ๐ฆฅ20b GGUF: unsloth/gpt-oss-20b-GGUF120b GGUF: unsloth/gpt-oss-120b-GGUFModel will run on 14GB RAM for 20b and 66GB for 120b. See translation 2 replies ยท โค๏ธ 17 17 ๐ฅ 5 5 ๐ 4 4 + Reply
FastVLM: Efficient Vision Encoding for Vision Language Models Paper โข 2412.13303 โข Published Dec 17, 2024 โข 52