Michael4933
/

Migician

Safetensors

English

qwen2_vl

Model card Files Files and versions Community

Michael4933 commited on 14 days ago

Commit

6b82c25

verified ·

1 Parent(s): 8319ea2

Update README.md

Browse files

Files changed (1) hide show

README.md +10 -10

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ base_model:
 - Qwen/Qwen2-VL-7B-Instruct
 ---
 Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
-<!--
 <p align="center">
     <img src="https://cdn-uploads.huggingface.co/production/uploads/654f3e104c8874c64d43aafa/RrciC01LCU7QUqh9kEAp-.png" style="width: 30%; max-width: 600px;">
 </p>
@@ -22,17 +22,17 @@ Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal L
 -----
-<a href='https://michael4933.github.io/'><img src='https://img.shields.io/badge/Project-Page-Green'></a> <a href='#'><img src='https://img.shields.io/badge/Demo-Page-purple'></a>  <a href='https://arxiv.org/abs/2411.03628'><img src='https://img.shields.io/badge/Paper-PDF-orange'></a>  <a href='https://huggingface.co/Michael4933/Migician'><img src='https://img.shields.io/badge/Model-Huggingface-red'></a>  <a href='https://huggingface.co/datasets/Michael4933/MIG-Bench'><img src='https://img.shields.io/badge/Benchmark-Huggingface-yellow'></a>  <a href='https://huggingface.co/datasets/Michael4933/MGrounding-630k'><img src='https://img.shields.io/badge/Dataset-Huggingface-blue'></a>
 This repository hosts the usage details of our training dataset <strong>MGrounding-630k</strong> and benchmark <strong>MIG-Bench</strong> and the training implementation of Migician, the first competitive Multi-image Grounding MLLM capable of free-form grounding.
 -----------
 ## 📰 News
-* **[2024.02.16]**  🥳🥳🥳 Our [Paper](https://arxiv.org/abs/2411.03628) has been accepted by ACL2025 as a Oral Paper!
 * **[2025.01.09]**  🌷🌷🌷 We have further released our multi-image grounding training dataset [MGrounding_630k](https://huggingface.co/datasets/Michael4933/MGrounding-630k) and our comprehensive multi-image grounding benchmark [MIG-Bench](https://huggingface.co/datasets/Michael4933/MIG-Bench) on Huggingface🤗~ Feel free to download and apply for your own use.
 * **[2025.01.05]**  🌟🌟🌟 The model weight is now available on HuggingFace! 🤗 Download and have a try at [Huggingface Model](https://huggingface.co/Michael4933/Migician)!
-* **[2025.01.02]** 🌞🌞🌞 We have released our paper on [Arxiv](https://arxiv.org/abs/2411.03628) at the start of the new year!
 ## 📝 Abstract
@@ -295,10 +295,10 @@ Migician/
 ## 📝 Citation
 ```bibtex
-@article{lin2024streaming,
-  title={StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding},
-  author={Junming Lin and Zheng Fang and Chi Chen and Zihao Wan and Fuwen Luo and Peng Li and Yang Liu and Maosong Sun},
-  journal={arXiv preprint arXiv:2411.03628},
-  year={2024}
 }
-``` -->

 - Qwen/Qwen2-VL-7B-Instruct
 ---
 Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
 <p align="center">
     <img src="https://cdn-uploads.huggingface.co/production/uploads/654f3e104c8874c64d43aafa/RrciC01LCU7QUqh9kEAp-.png" style="width: 30%; max-width: 600px;">
 </p>
 -----
+<a href='https://michael4933.github.io/'><img src='https://img.shields.io/badge/Project-Page-Green'></a> <a href='#'><img src='https://img.shields.io/badge/Demo-Page-purple'></a>  <a href='https://arxiv.org/abs/2501.05767'><img src='https://img.shields.io/badge/Paper-PDF-orange'></a>  <a href='https://huggingface.co/Michael4933/Migician'><img src='https://img.shields.io/badge/Model-Huggingface-red'></a>  <a href='https://huggingface.co/datasets/Michael4933/MIG-Bench'><img src='https://img.shields.io/badge/Benchmark-Huggingface-yellow'></a>  <a href='https://huggingface.co/datasets/Michael4933/MGrounding-630k'><img src='https://img.shields.io/badge/Dataset-Huggingface-blue'></a>
 This repository hosts the usage details of our training dataset <strong>MGrounding-630k</strong> and benchmark <strong>MIG-Bench</strong> and the training implementation of Migician, the first competitive Multi-image Grounding MLLM capable of free-form grounding.
 -----------
 ## 📰 News
+* **[2024.02.16]**  🥳🥳🥳 Our [Paper](https://arxiv.org/abs/2501.05767) has been accepted by ACL2025 as a Oral Paper!
 * **[2025.01.09]**  🌷🌷🌷 We have further released our multi-image grounding training dataset [MGrounding_630k](https://huggingface.co/datasets/Michael4933/MGrounding-630k) and our comprehensive multi-image grounding benchmark [MIG-Bench](https://huggingface.co/datasets/Michael4933/MIG-Bench) on Huggingface🤗~ Feel free to download and apply for your own use.
 * **[2025.01.05]**  🌟🌟🌟 The model weight is now available on HuggingFace! 🤗 Download and have a try at [Huggingface Model](https://huggingface.co/Michael4933/Migician)!
+* **[2025.01.02]** 🌞🌞🌞 We have released our paper on [Arxiv](https://arxiv.org/abs/2501.05767) at the start of the new year!
 ## 📝 Abstract
 ## 📝 Citation
 ```bibtex
+@misc{li2025migicianrevealingmagicfreeform,
+      title={Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models},
+      author={You Li and Heyu Huang and Chi Chen and Kaiyu Huang and Chao Huang and Zonghao Guo and Zhiyuan Liu and Jinan Xu and Yuhua Li and Ruixuan Li and Maosong Sun},
+      year={2025},
+      url={https://arxiv.org/abs/2501.05767},
 }
+```