view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 91
view article Article Controlling Language Model Generation with NVIDIA's LogitsProcessorZoo Dec 23, 2024 • 38
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining Paper • 2501.00958 • Published 28 days ago • 99
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 128
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated Dec 18, 2024 • 18
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published Dec 15, 2024 • 27
Synthetic Data Generator Collection A collection of tools and datasets related to no-code the Synthetic Data Generation. • 19 items • Updated 10 days ago • 7
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published Dec 4, 2024 • 46
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated Dec 13, 2024 • 132
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 134
LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation Paper • 2411.04997 • Published Nov 7, 2024 • 37