Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Aliayub1995
/
VideoLLaMA2-7B
like
1
Visual Question Answering
Transformers
Safetensors
OpenGVLab/VideoChat2-IT
Lin-Chen/ShareGPT4V
liuhaotian/LLaVA-Instruct-150K
English
videollama2_mistral
text-generation
multimodal large language model
large video-language model
Inference Endpoints
arxiv:
2406.07476
arxiv:
2306.02858
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
a0a5594
VideoLLaMA2-7B
/
videollama2
/
model
2 contributors
History:
1 commit
Aliayub1995
Upload 52 files
87ce8f2
verified
4 months ago
__init__.py
Safe
11.6 kB
Upload 52 files
4 months ago
encoder.py
Safe
6.09 kB
Upload 52 files
4 months ago
projector.py
Safe
8.83 kB
Upload 52 files
4 months ago
videollama2_arch.py
Safe
13 kB
Upload 52 files
4 months ago
videollama2_gemma2.py
Safe
6.25 kB
Upload 52 files
4 months ago
videollama2_llama.py
Safe
5.46 kB
Upload 52 files
4 months ago
videollama2_mistral.py
Safe
5.55 kB
Upload 52 files
4 months ago
videollama2_mixtral.py
Safe
5.38 kB
Upload 52 files
4 months ago
videollama2_phi3.py
Safe
5.49 kB
Upload 52 files
4 months ago
videollama2_qwen2.py
Safe
5.4 kB
Upload 52 files
4 months ago