csnyder
's Collections
llm tryme
updated
google/flan-t5-large
Text2Text Generation
•
Updated
•
1.64M
•
•
662
deepseek-ai/deepseek-coder-6.7b-instruct
Text Generation
•
Updated
•
14k
•
361
Object Recognition as Next Token Prediction
Paper
•
2312.02142
•
Published
•
11
colbert-ir/dspy-Oct11-T5-Large-MH-3k-v1
Text2Text Generation
•
Updated
•
105
•
1
microsoft/phi-1_5
Text Generation
•
Updated
•
103k
•
1.32k
OLMo: Accelerating the Science of Language Models
Paper
•
2402.00838
•
Published
•
82
Quiet-STaR: Language Models Can Teach Themselves to Think Before
Speaking
Paper
•
2403.09629
•
Published
•
75
🐠
Idefics 8b
Yi: Open Foundation Models by 01.AI
Paper
•
2403.04652
•
Published
•
62
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and
Vision-Language Models Derived from Scientific Literature
Paper
•
2501.07171
•
Published
•
44