Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
MLap
's Collections
Multimodality
AI Agents
Multimodality
updated
25 days ago
Upvote
-
Emu3: Next-Token Prediction is All You Need
Paper
•
2409.18869
•
Published
Sep 27
•
91
Harnessing Webpage UIs for Text-Rich Visual Understanding
Paper
•
2410.13824
•
Published
Oct 17
•
29
Upvote
-
Share collection
View history
Collection guide
Browse collections