Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nvidia
/
NVLM-D-72B
like
766
Follow
NVIDIA
12.8k
Image-Text-to-Text
Transformers
Safetensors
PyTorch
English
NVLM_D
nvidia
NVLM
multimodal
conversational
custom_code
Inference Endpoints
arxiv:
2409.11402
License:
cc-by-nc-4.0
Model card
Files
Files and versions
Community
29
Train
Deploy
Use this model
main
NVLM-D-72B
/
eval
/
requirements.txt
boxinw@nvidia.com
Add benchmark evaluation scripts
b925209
23 days ago
raw
Copy download link
history
blame
contribute
delete
Safe
28 Bytes
anls
datasets
pycocoevalcap