4 10 14

Peng Jin

Chat-UniVi

https://scholar.google.com/citations?user=HHXLexAAAAAJ&hl=en

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago

openai/gpt-oss-120b

updated a dataset 3 months ago

Chat-UniVi/browsecomp

published a dataset 3 months ago

Chat-UniVi/browsecomp

View all activity

Organizations

None yet

authored a paper 8 months ago

VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding

Paper • 2501.13106 • Published Jan 22 • 90

authored a paper 10 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 129

authored 15 papers 11 months ago

DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Paper • 2303.09867 • Published Mar 17, 2023

Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation

Paper • 2303.13399 • Published Mar 23, 2023

Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

Paper • 2303.14369 • Published Mar 25, 2023

Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment

Paper • 2305.12218 • Published May 20, 2023

Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Paper • 2311.08046 • Published Nov 14, 2023 • 2

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Paper • 2311.10122 • Published Nov 16, 2023 • 27

Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting

Paper • 2312.13271 • Published Dec 20, 2023 • 6

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Paper • 2401.15947 • Published Jan 29, 2024 • 53

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Paper • 2402.05935 • Published Feb 8, 2024 • 17

LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference

Paper • 2406.18139 • Published Jun 26, 2024 • 2

WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation

Paper • 2306.10750 • Published Jun 19, 2023

Peng Jin

AI & ML interests

Recent Activity

Organizations

Chat-UniVi's activity