MBZUAI
/

Video-ChatGPT-7B

Visual Question Answering

Inference Endpoints

Model card Files Files and versions Community

Video-ChatGPT-7B / README.md

mmaaz60's picture

Update README.md

1b89a97 over 1 year ago

|

history blame contribute delete

481 Bytes

	---
	license: cc-by-4.0
	datasets:
	- MBZUAI/Video-Instruct-Dataset
	language:
	- en
	library_name: transformers
	pipeline_tag: visual-question-answering
	---

	Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos.
	It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation.

	GitHub: [https://github.com/mbzuai-oryx/Video-ChatGPT](https://github.com/mbzuai-oryx/Video-ChatGPT)