Visual Question Answering
Transformers
Safetensors
English
videollama2_qwen2
text-generation
multimodal large language model
large video-language model
Inference Endpoints
VideoLLaMA2.1-7B-16F / added_tokens.json
Siheng99's picture
Upload model files.
9db092d
raw
history blame
80 Bytes
{
"<|endoftext|>": 151643,
"<|im_end|>": 151645,
"<|im_start|>": 151644
}