view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 • 165
ChatQA: Building GPT-4 Level Conversational QA Models Paper • 2401.10225 • Published Jan 18 • 33
What You See is What You GAN: Rendering Every Pixel for High-Fidelity Geometry in 3D GANs Paper • 2401.02411 • Published Jan 4 • 12
MobileSAMv2: Faster Segment Anything to Everything Paper • 2312.09579 • Published Dec 15, 2023 • 20
Lost in the Middle: How Language Models Use Long Contexts Paper • 2307.03172 • Published Jul 6, 2023 • 36
LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding Paper • 2306.17107 • Published Jun 29, 2023 • 11