RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published 5 days ago • 42
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • 10 days ago • 94
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Paper • 2411.04282 • Published 17 days ago • 30
Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model Paper • 2411.04496 • Published 17 days ago • 22 • 3
Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model Paper • 2411.04496 • Published 17 days ago • 22
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published 16 days ago • 109
Zebra-Llama: A Context-Aware Large Language Model for Democratizing Rare Disease Knowledge Paper • 2411.02657 • Published 19 days ago • 5
ATM: Improving Model Merging by Alternating Tuning and Merging Paper • 2411.03055 • Published 19 days ago • 1
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments Paper • 2410.23918 • Published 24 days ago • 18 • 6
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments Paper • 2410.23918 • Published 24 days ago • 18 • 6
BitStack: Fine-Grained Size Control for Compressed Large Language Models in Variable Memory Environments Paper • 2410.23918 • Published 24 days ago • 18