Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 6 days ago • 43
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training Paper • 2411.15124 • Published 5 days ago • 50
Multimodal Autoregressive Pre-training of Large Vision Encoders Paper • 2411.14402 • Published 6 days ago • 36
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published 7 days ago • 36
Retrieve, Annotate, Evaluate, Repeat: Leveraging Multimodal LLMs for Large-Scale Product Retrieval Evaluation Paper • 2409.11860 • Published Sep 18 • 1
LLM2CLIP Collection LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 7 items • Updated 8 days ago • 38
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published 15 days ago • 59
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 14 days ago • 95
LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation Paper • 2411.04997 • Published 20 days ago • 35
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models Paper • 2411.04996 • Published 20 days ago • 48
Balancing Pipeline Parallelism with Vocabulary Parallelism Paper • 2411.05288 • Published 20 days ago • 19
LBPE: Long-token-first Tokenization to Improve Large Language Models Paper • 2411.05504 • Published 19 days ago • 1
Measuring short-form factuality in large language models Paper • 2411.04368 • Published 21 days ago • 1