Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability
Language Technology Lab at Alibaba DAMO Academy
company
AI & ML interests
None defined yet.
Recent Activity
Collections
5
spaces
3
models
42
DAMO-NLP-SG/VideoRefer-7B-stage2.5
Visual Question Answering
•
Updated
•
35
•
2
DAMO-NLP-SG/VideoRefer-7B-stage2
Visual Question Answering
•
Updated
•
13
•
1
DAMO-NLP-SG/VideoRefer-7B
Visual Question Answering
•
Updated
•
109
•
3
DAMO-NLP-SG/DiGIT
Unconditional Image Generation
•
Updated
•
4
DAMO-NLP-SG/VideoLLaMA2.1-7B-AV
Visual Question Answering
•
Updated
•
968
•
14
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F
Visual Question Answering
•
Updated
•
2.04k
•
8
DAMO-NLP-SG/VideoLLaMA2.1-7B-16F-Base
Visual Question Answering
•
Updated
•
748
•
1
DAMO-NLP-SG/LiT-B-32_CC12M
Updated
•
1
DAMO-NLP-SG/VideoLLaMA2-72B
Visual Question Answering
•
Updated
•
79
•
10
DAMO-NLP-SG/VideoLLaMA2-72B-Base
Visual Question Answering
•
Updated
•
24
•
1
datasets
9
DAMO-NLP-SG/multimodal_textbook
Updated
•
8.57k
•
113
DAMO-NLP-SG/VideoRefer-Bench
Updated
•
36
DAMO-NLP-SG/CMM
Updated
•
44
•
5
DAMO-NLP-SG/Multi-Source-Video-Captioning
Viewer
•
Updated
•
1.5k
•
67
•
6
DAMO-NLP-SG/LongCorpus-2.5B
Preview
•
Updated
•
36
•
8
DAMO-NLP-SG/SOUL
Viewer
•
Updated
•
15k
•
54
DAMO-NLP-SG/MultiJail
Viewer
•
Updated
•
315
•
48
•
6
DAMO-NLP-SG/HyperlinkMRC
Updated
•
36
•
2
DAMO-NLP-SG/SSTuning-datasets
Updated
•
34