Vision Transformer Adapters for Generalizable Multitask Learning Paper • 2308.12372 • Published Aug 23, 2023
DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion Paper • 2309.12424 • Published Sep 21, 2023 • 11
PaLI-3 Vision Language Models: Smaller, Faster, Stronger Paper • 2310.09199 • Published Oct 13, 2023 • 24
Approximating Two-Layer Feedforward Networks for Efficient Transformers Paper • 2310.10837 • Published Oct 16, 2023 • 10