Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Yizhi Song's picture
6 2

Yizhi Song

song630
·

AI & ML interests

GenAI

Recent Activity

upvoted a paper 11 days ago
Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination
upvoted a paper about 2 months ago
Building a Foundational Guardrail for General Agentic Systems via Synthetic Data
upvoted a paper 2 months ago
Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models
View all activity

Organizations

None yet

upvoted a paper 11 days ago

Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination

Paper • 2511.17490 • Published 15 days ago • 21
upvoted a paper about 2 months ago

Building a Foundational Guardrail for General Agentic Systems via Synthetic Data

Paper • 2510.09781 • Published Oct 10 • 26
upvoted a paper 2 months ago

Video-LMM Post-Training: A Deep Dive into Video Reasoning with Large Multimodal Models

Paper • 2510.05034 • Published Oct 6 • 48
upvoted a paper 8 months ago

Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting

Paper • 2504.05541 • Published Apr 7 • 15
upvoted a paper 9 months ago

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6 • 96
upvoted a paper 11 months ago

Generative AI for Cel-Animation: A Survey

Paper • 2501.06250 • Published Jan 8 • 12
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs