Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
TIGER-Lab 's Collections
ImagenWorld
BrowserAgent
EditReward
VideoScore2
Critique-Coder
WebExplorer
MMLU-Pro
VLM2Vec-V2
VisCoder
One-Shot-CFT
Pixel-Reasoner
MoCha
General-Reasoner
VL-Rethinker
Vamba
TheoremExplain
ABC
VisualWebInstruct
PixelWorld
AceCoder
CritiqueFineTuning
MAmmoTH-VL
ScholarCopilot
VISTA
OmniEdit
MEGA-Bench
VLM2Vec
TIGERScore
MAmmoTH
UniIR
ImagenHub
Science
StructLM
ConsistI2V
Mantis
MAmmoTH2
VideoScore
Long-Context

VisualWebInstruct

updated about 13 hours ago

Scaling up MM data

Upvote
1

  • TIGER-Lab/VisualWebInstruct-Recall

    Viewer • Updated Mar 16 • 361k • 359 • 4

  • TIGER-Lab/VisualWebInstruct-Seed

    Viewer • Updated Mar 16 • 60.3k • 168 • 18

  • TIGER-Lab/VisualWebInstruct

    Viewer • Updated Aug 12 • 1.91M • 1.22k • 37

  • VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

    Paper • 2503.10582 • Published Mar 13 • 24

  • TIGER-Lab/MAmmoTH-VL2

    Image-Text-to-Text • 8B • Updated May 7 • 7 • 13

  • Running on Zero
    2
    2

    MAmmoTH-VL2

    🐠

    Strong Vision Language Model trained with VisualWebInstruct


  • TIGER-Lab/VisualWebInstruct_Verified

    Viewer • Updated about 14 hours ago • 97.3k • 2 • 2
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs