BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Paper • 2501.07171 • Published 5 days ago • 45
ChatQA 2: Bridging the Gap to Proprietary LLMs in Long Context and RAG Capabilities Paper • 2407.14482 • Published Jul 19, 2024 • 26
Running on CPU Upgrade 66 🏆 Open Ita Llm Leaderboard Track, rank and evaluate open LLMs in the italian language!
view post Post 5783 Working on a concept GPT-2 (small) that uses KANs instead of MLPs.The ckpt and training code will be soon on the hub. 6 replies · 🚀 31 31 👍 13 13 🔥 11 11 🤯 4 4 ➕ 4 4 + Reply
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 23 items • Updated Dec 18, 2024 • 181
Rethinking Interpretability in the Era of Large Language Models Paper • 2402.01761 • Published Jan 30, 2024 • 23
RAG vs Fine-tuning: Pipelines, Tradeoffs, and a Case Study on Agriculture Paper • 2401.08406 • Published Jan 16, 2024 • 37