Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
DAMO-NLP-SG
/
VideoLLaMA2.1-7B-AV
like
9
Follow
Language Technology Lab at Alibaba DAMO Academy
45
Visual Question Answering
Transformers
Safetensors
lmms-lab/ClothoAQA
Loie/VGGSound
English
videollama2_qwen2
text-generation
Audio-visual Question Answering
Audio Question Answering
multimodal large language model
Inference Endpoints
arxiv:
2406.07476
arxiv:
2306.02858
License:
apache-2.0
Model card
Files
Files and versions
Community
3
Train
Deploy
Use this model
main
VideoLLaMA2.1-7B-AV
/
README.md
Commit History
Update README.md
d944d42
verified
YifeiXin
commited on
about 1 month ago
Update README.md
b9c58e1
verified
lixin4ever
commited on
Oct 22
Update README.md
fba52ca
verified
lixin4ever
commited on
Oct 22
Update README.md
4c84984
verified
lixin4ever
commited on
Oct 22
Update README.md
c7e14fd
verified
YifeiXin
commited on
Oct 22
initial commit
eacaf06
verified
YifeiXin
commited on
Oct 21