Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
kpyu
/
video-blip-flan-t5-xl-ego4d
like
3
Image-to-Text
Transformers
PyTorch
English
blip-2
text2text-generation
vision
video-to-text
image-captioning
video-captioning
visual-question-answering
Inference Endpoints
arxiv:
2301.12597
arxiv:
2210.11416
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Adding `safetensors` variant of this model
#1 opened 11 months ago by
SFconvertbot