# Mirage 12B Mirage represents a novel series of multi-modal models, positioned at a higher tier than the Bumblebee series. Our preceding 7B model (within the Bumblebee series) had attained a high score on numerous benchmarks. To further enhance its performance, we trained a 12B model. It comprises a ViT vision encoder and a Mistral nemo 12B model. The results indicate a substantial improvement when compared to many other open-source models. Our Mirage 12B has surpassed Cambrian-34B and LLava-Next-34B with a single 12B parameter. The benchmark scores can be viewed on the MMBench official leaderboard. ## Roadmap The currently released version is 2.0 of Mirage. Our 2.1 version is in progress and is likely to be even more potent with multi-lingual OCR support. If you have any inquiries, feel free to contact us through the issue!