Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 13 days ago • 43
Finance Commons Collection A large collection of multimodal financial documents in open data. • 7 items • Updated Jul 17 • 7
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 12 days ago • 543
MedEmbed: Embedding Models for Medical Domain Collection GitHub -> https://github.com/abhinand5/MedEmbed • 4 items • Updated Oct 21 • 9
Addition is All You Need for Energy-efficient Language Models Paper • 2410.00907 • Published Oct 1 • 144
Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts Paper • 2409.16040 • Published Sep 24 • 12
DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 4 days ago • 81