Zhixiong Zhang (SII)

rookiexiong

rookiexiong7

AI & ML interests

SJTU & SII Ph.D. Student, SII is an institution dedicated to innovation in education and research in the field of AI.

Recent Activity

upvoted a paper 3 days ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

upvoted a paper 5 days ago

UniREditBench: A Unified Reasoning-based Image Editing Benchmark

upvoted a paper 11 days ago

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

View all activity

Organizations

upvoted a paper 3 days ago

When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought

Paper • 2511.02779 • Published 5 days ago • 52

upvoted a paper 5 days ago

UniREditBench: A Unified Reasoning-based Image Editing Benchmark

Paper • 2511.01295 • Published 6 days ago • 36

upvoted a paper 11 days ago

STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence

Paper • 2510.24693 • Published 12 days ago • 18

upvoted 2 papers about 1 month ago

SPARK: Synergistic Policy And Reward Co-Evolving Framework

Paper • 2509.22624 • Published Sep 26 • 17

CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning

Paper • 2509.22647 • Published Sep 26 • 32

upvoted a paper about 2 months ago

SIM-CoT: Supervised Implicit Chain-of-Thought

Paper • 2509.20317 • Published Sep 24 • 41

authored a paper 2 months ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27 • 36

upvoted a paper 2 months ago

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27 • 36

upvoted a paper 3 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 255

authored a paper 3 months ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published Feb 18 • 41

upvoted a paper 3 months ago

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Paper • 2508.04700 • Published Aug 6 • 52

authored a paper 3 months ago

GPT4Scene: Understand 3D Scenes from Videos with Vision-Language Models

Paper • 2501.01428 • Published Jan 2

upvoted 2 papers 3 months ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 130

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Paper • 2508.00819 • Published Aug 1 • 62

authored a paper 4 months ago

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

Paper • 2507.15852 • Published Jul 21 • 38

upvoted a paper 4 months ago

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

Paper • 2507.15852 • Published Jul 21 • 38

updated a model 4 months ago

OpenIXCLab/SeC-4B

Mask Generation • 4B • Updated Jul 22 • 621 • 25

updated a dataset 4 months ago

OpenIXCLab/SeCVOS

Viewer • Updated Jul 22 • 182 • 79 • 3

liked a model 4 months ago