--- license: apple-ascl pipeline_tag: image-text-to-text --- This repository contains the Elva-OpenELM-450M model presented in [On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning](https://huggingface.co/papers/2406.11823).