-
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
Paper • 2309.10150 • Published • 24 -
In-Context Pretraining: Language Modeling Beyond Document Boundaries
Paper • 2310.10638 • Published • 29 -
Farzi Data: Autoregressive Data Distillation
Paper • 2310.09983 • Published • 9 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 49
Mat Miller
matdmiller
AI & ML interests
None yet
Recent Activity
upvoted
an
article
12 days ago
Finally, a Replacement for BERT: Introducing ModernBERT
liked
a Space
5 months ago
gpt-omni/mini-omni
liked
a model
6 months ago
vikp/surya_det3
Organizations
Collections
1
spaces
4
datasets
None public yet