Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
MBZUAI
's Collections
ArTST - Arabic Text Speech Transformer
VideoGPT+
GLaMM
Video-ChatGPT
LLaVA++ (LLaMA-3 and Phi-3-Mini)
PALO
MobiLlama
GeoChat
Satmae++
VideoGPT+
updated
Jun 11
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding
Upvote
3
MBZUAI/VideoGPT-plus_Phi3-mini-4k
Updated
Jun 17
•
6
MBZUAI/VideoGPT-plus_Phi3-mini-4k_Pretrain
Updated
Jun 17
•
1
MBZUAI/VCGBench-Diverse
Updated
Jul 1
•
462
•
3
MBZUAI/VCG-plus_112K
Viewer
•
Updated
Jun 17
•
139k
•
123
•
6
MBZUAI/video_annotation_pipeline
Viewer
•
Updated
Jun 17
•
1
•
144
•
2
MBZUAI/VideoGPT-plus_Training_Dataset
Viewer
•
Updated
Jun 6
•
576k
•
703
•
8
MBZUAI/VideoGPT-plus_Phi3-mini-4k_Ablations
Updated
Jun 13
MBZUAI/VideoGPT-plus_LLaMA3-8B-8k
Updated
Jun 13
MBZUAI/VideoGPT-plus_Vicuna-13B-4k
Updated
Jun 13
•
1
MBZUAI/VideoGPT-plus_Vicuna-7B-4k
Updated
Jun 13
Upvote
3
Share collection
View history
Collection guide
Browse collections