ImagenHub

community

https://tiger-ai-lab.github.io/ImagenHub/

Activity Feed Request to join this org

AI & ML interests

Multimedia

Recent Activity

vinesmsuic authored a paper 3 days ago

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

drogozhang authored a paper 8 days ago

Agent Learning via Early Experience

wenhu authored a paper 19 days ago

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

View all activity

vinesmsuic

authored a paper 3 days ago

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

Paper • 2509.26346 • Published 18 days ago • 17

drogozhang

authored a paper 8 days ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published 9 days ago • 223

wenhu

authored a paper 19 days ago

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published 22 days ago • 66

drogozhang

authored a paper about 2 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31 • 83

wenhu

authored 3 papers 5 months ago

VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation

Paper • 2506.03930 • Published Jun 4 • 26

Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem

Paper • 2506.03295 • Published Jun 3 • 17

StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs

Paper • 2505.20139 • Published May 26 • 19

wenhu

authored 5 papers 7 months ago

ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations

Paper • 2504.00824 • Published Apr 1 • 43

wenhu

authored a paper 8 months ago

ABC: Achieving Better Control of Multimodal Embeddings using VLMs

Paper • 2503.00329 • Published Mar 1 • 19

vinesmsuic

authored a paper 8 months ago

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published Feb 26 • 48

Fiaa

authored a paper 9 months ago

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Paper • 2501.05452 • Published Jan 9 • 15

wenhu

authored a paper 10 months ago

MAmmoTH-VL: Eliciting Multimodal Reasoning with Instruction Tuning at Scale

Paper • 2412.05237 • Published Dec 6, 2024 • 46

wenhu

authored a paper 11 months ago

VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation

Paper • 2412.00927 • Published Dec 1, 2024 • 29

Anwen0809

authored a paper 12 months ago

Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning

Paper • 2408.08640 • Published Aug 16, 2024 • 3

drogozhang

authored a paper 12 months ago

Long-LRM: Long-sequence Large Reconstruction Model for Wide-coverage Gaussian Splats

Paper • 2410.12781 • Published Oct 16, 2024 • 6

wenhu

authored a paper about 1 year ago

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 31

AI & ML interests

Recent Activity

Team members 7

ImagenHub's activity