Fractal Patterns May Unravel the Intelligence in Next-Token Prediction Paper • 2402.01825 • Published Feb 2 • 2
Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution Paper • 2307.06304 • Published Jul 12, 2023 • 27
PaLI-X: On Scaling up a Multilingual Vision and Language Model Paper • 2305.18565 • Published May 29, 2023 • 3