Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
ibm-granite 's Collections
Granite 4.1 Language Models
Granite Docling
Granite 4.0 Language Models
Granite 4.0 Nano Language Models
Granite Embedding
Granite Speech
Granite Vision
Granite Guardian
Granite Time Series
Granite Libraries
Granite 3.3
Granite Geospatial Models
Granite Data
Granite Experiments
Granite Quantized Models

Granite Vision

updated 23 days ago

Multimodal models built for visual document analysis and image understanding.

Upvote
41

  • Running on Zero
    Agents
    41

    Multimodal RAG with Granite Vision

    🚀
    41

    RAG example using Granite [vision, embedding, instruct]


  • ibm-granite/granite-vision-4.1-4b

    Image-Text-to-Text • 4B • Updated 5 days ago • 54.5k • 78

  • ibm-granite/granite-vision-3.3-2b-embedding

    Feature Extraction • 3B • Updated Aug 16, 2025 • 72 • 28

  • ibm-granite/granite-vision-3.1-2b-preview

    Image-Text-to-Text • Updated Jun 12, 2025 • 1.07k • 113

  • ibm-granite/granite-vision-3.3-2b

    Image-to-Text • 3B • Updated Apr 2 • 134k • 83

  • ibm-granite/granite-4.0-3b-vision

    Image-Text-to-Text • 4B • Updated 22 days ago • 119k • 109

  • ibm-granite/granite-vision-3.2-2b

    Image-Text-to-Text • 3B • Updated Apr 2 • 5.12k • 122
Upvote
41
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs