Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
XiaomiMiMo
/
MiMo-VL-7B-SFT
like
54
Follow
Xiaomi MiMo
973
Image-Text-to-Text
Transformers
Safetensors
qwen2_5_vl
image-to-text
conversational
text-generation-inference
arxiv:
2506.03569
License:
mit
Model card
Files
Files and versions
xet
Community
2
Deploy
Use this model
857b7ef
MiMo-VL-7B-SFT
1.52 kB
3 contributors
History:
1 commit
bwshen-mi
initial commit
857b7ef
verified
7 months ago
.gitattributes
1.52 kB
initial commit
7 months ago