Visual Question Answering
Transformers
English
videollama2_qwen2
text-generation
multimodal large language model
large video-language model
Inference Endpoints

Commit History

Update README.md
d3289bb
verified

lixin4ever commited on

Update the results of VideoLLaMA2.1
ec13b66
verified

lixin4ever commited on

Upload projector model files.
3195661

Siheng99 commited on

initial commit
4e2c694
verified

ClownRat commited on