DAMO-NLP-SG
/

VideoLLaMA2.1-7B-AV

Visual Question Answering

videollama2_qwen2

text-generation

Audio-visual Question Answering

Audio Question Answering

multimodal large language model

Inference Endpoints

Model card Files Files and versions Community

VideoLLaMA2.1-7B-AV / vocab.json

阔毅

add VideoLLaMA2.1-AV model

acd1625 about 1 month ago

history contribute delete

2.78 MB

File too large to display, you can check the raw version instead.