metadata
license: apache-2.0
language:
- en
metrics:
- accuracy
base_model:
- Qwen/Qwen2-VL-7B-Instruct
pipeline_tag: visual-question-answering
MMEvol Model Card
Model Details
Here are the pretrained weights and instruction tuning weights
Model | Pretrained Projector | Base LLM | PT Data | IT Data | Download |
---|---|---|---|---|---|
MMEvol-Qwen2-7B | mm_projector | Qwen2-7B | LLaVA-Pretrain | MMEvol | ckpt |
Performance
VLMEvalKit Support (OpenCompass)
Model | MME_C | MMStar | HallBench | MathVista_mini | MMMU_val | AI2D | POPE | BLINK | RWQA |
---|---|---|---|---|---|---|---|---|---|
MMEvol-Qwen2-7B | 55.8 | 51.6 | 64.1 | 52.4 | 45.1 | 74.7 | 87.8 | 47.7 | 63.9 |
VLMEvalKit Not Support (VQADataSet)
Model | VQA_v2 | GQA | MIA | MMSInst |
---|---|---|---|---|
MMEvol-Qwen2-7B | 83.1 | 65.5 | 77.6 | 41.8 |
Paper or resources for more information
License
Llama 3 is licensed under the LLAMA 3 Community License, Copyright (c) Meta Platforms, Inc. All Rights Reserved.
Contact us if you have any questions
- Run Luo — r.luo@siat.ac.cn
- Haonan Zhang — zchiowal@gmail.com