Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
microsoft
/
OmniParser
like
1.34k
Follow
Microsoft
5,236
Image-Text-to-Text
Transformers
Safetensors
blip-2
visual-question-answering
Inference Endpoints
arxiv:
2408.00203
License:
mit
Model card
Files
Files and versions
Community
36
Train
Deploy
Use this model
61d8bf9
OmniParser
3 contributors
History:
9 commits
adamlu1
Upload icon_detect_model.safetensors
61d8bf9
verified
about 1 month ago
icon_caption_blip2
Adding `safetensors` variant of this model (#2)
about 1 month ago
icon_detect
Upload icon_detect_model.safetensors
about 1 month ago
.gitattributes
Safe
1.52 kB
initial commit
about 2 months ago
README.md
Safe
2.83 kB
Add metadata
about 1 month ago
config.json
Safe
985 Bytes
update
about 1 month ago