Add model card

This PR adds a model card, linking to the paper [VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction](https://huggingface.co/papers/2501.01957).

It also adds the `pipeline_tag`, ensuring people can find it at https://huggingface.co/models?pipeline_tag=video-text-to-text as well as a link to the Github repository.

Files changed (1) hide show

README.md +6 -0

README.md ADDED Viewed

	@@ -0,0 +1,6 @@

+---
+pipeline_tag: video-text-to-text
+---
+This repository contains the model of the paper [VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction](https://huggingface.co/papers/2501.01957).
+Code: https://github.com/VITA-MLLM/VITA