ggbond's picture

2 8 3

ggbond

zhangzixin02

·

AI & ML interests

None yet

Recent Activity

authored a paper about 10 hours ago

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

upvoted a paper 6 days ago

Accelerating Streaming Video Large Language Models via Hierarchical Token Compression

upvoted a paper 6 days ago

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

View all activity

Organizations

None yet

upvoted 2 papers 6 days ago

Accelerating Streaming Video Large Language Models via Hierarchical Token Compression

Paper • 2512.00891 • Published 8 days ago • 14

DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation

Paper • 2511.23127 • Published 10 days ago • 42

upvoted a paper 21 days ago

TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models

Paper • 2511.13704 • Published 21 days ago • 42

upvoted a paper about 1 month ago

Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks

Paper • 2510.25760 • Published Oct 29 • 16

upvoted 2 papers about 2 months ago

How to Teach Large Multimodal Models New Skills

Paper • 2510.08564 • Published Oct 9 • 2

PhysToolBench: Benchmarking Physical Tool Understanding for MLLMs

Paper • 2510.09507 • Published Oct 10 • 10

upvoted a paper 2 months ago

LongLive: Real-time Interactive Long Video Generation

Paper • 2509.22622 • Published Sep 26 • 184

upvoted a paper 8 months ago

DiMeR: Disentangled Mesh Reconstruction Model

Paper • 2504.17670 • Published Apr 24 • 24