Model Card for VideoChat2

This modelcard aims to give the model info of 'MVBench: A Comprehensive Multi-modal Video Understanding Benchmark'.

Model Details

Model Sources

Repository: VideoChat2
Paper: 2311.17005
Point of Contact: mailto:InternVideo Group

Citation

If you find this work useful for your research, please consider citing VideoChat2. Your acknowledgement would greatly help us in continuing to contribute resources to the research community.

@article{li2023videochat,
  title={VideoChat: Chat-Centric Video Understanding},
  author={KunChang Li, Yinan He, Yi Wang, Yizhuo Li, Wenhai Wang, Ping Luo, Yali Wang, Limin Wang, and Yu Qiao},
  journal={arXiv preprint arXiv:2305.06355},
  year={2023}
}

@misc{li2023mvbench,
      title={MVBench: A Comprehensive Multi-modal Video Understanding Benchmark}, 
      author={Kunchang Li and Yali Wang and Yinan He and Yizhuo Li and Yi Wang and Yi Liu and Zun Wang and Jilan Xu and Guo Chen and Ping Luo and Limin Wang and Yu Qiao},
      year={2023},
      eprint={2311.17005},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Papers for Andy1621/VideoChat2_VicunaV0_7B_stage3_noLoRA

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark

Paper • 2311.17005 • Published Nov 28, 2023 • 2

VideoChat: Chat-Centric Video Understanding

Paper • 2305.06355 • Published May 10, 2023 • 3