Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Skier8402 's Collections
Mechanistic Interpretability
translation
OCR
biomedical
Browser-agents
Realtime-apps
Leaderboards
Quantization tools
3Dmodels
Reasoning-models
Embedding models
Swahili models
multimodal
Diffusion model tools
metrics
RAG-agents
Speech apps
Prompts
Interesting finds
Chat-agents
Datasets
LLM-transparency-tools
Data creation
Computer vision

Computer vision

updated Mar 25

Image and video models

Upvote
-

  • Runtime error
    198
    198

    Better Florence 2

    🔥

    Analyze images to detect objects, generate captions, or perform OCR


  • Runtime error
    34
    34

    EfficientSAM vs SAM

    ⚔


  • Running on Zero
    31
    31

    Llava Interleave

    🌋

    Generate answers by uploading images or videos


  • Running on Zero
    1.78k
    1.78k

    DALLE 3 XL v2

    🔥

    Generate images from text prompts


  • Running on Zero
    129
    129

    Segment Anything 2

    🔥

    Segment objects in images using prompts


  • Runtime error
    516
    516

    Florence2 + SAM2

    🔥

    Segment and caption objects in images and videos


  • Running on T4
    75
    75

    RF-DETR

    🔥

    SOTA real-time object detection model

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs