3 6

Ying Shan

yshan2u

AI & ML interests

None yet

Recent Activity

authored a paper about 19 hours ago

BrushEdit: All-In-One Image Inpainting and Editing

authored a paper about 19 hours ago

ColorFlow: Retrieval-Augmented Image Sequence Colorization

authored a paper 1 day ago

FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction

View all activity

Organizations

yshan2u's activity

authored 2 papers about 19 hours ago

BrushEdit: All-In-One Image Inpainting and Editing

Paper • 2412.10316 • Published 4 days ago • 25

ColorFlow: Retrieval-Augmented Image Sequence Colorization

Paper • 2412.11815 • Published 1 day ago • 21

authored a paper 1 day ago

FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction

Paper • 2412.09573 • Published 5 days ago • 7

authored a paper 8 days ago

Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

Paper • 2412.04432 • Published 12 days ago • 13

authored a paper 9 days ago

Moto: Latent Motion Token as the Bridging Language for Robot Manipulation

Paper • 2412.04445 • Published 12 days ago • 21

authored a paper 13 days ago

NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images

Paper • 2412.03517 • Published 13 days ago • 18

authored a paper 3 months ago

ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

Paper • 2409.02048 • Published Sep 3 • 3

upvoted a paper 3 months ago

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Paper • 2409.02095 • Published Sep 3 • 35

authored 2 papers 3 months ago

Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation

Paper • 2409.04410 • Published Sep 6 • 23

DepthCrafter: Generating Consistent Long Depth Sequences for Open-world Videos

Paper • 2409.02095 • Published Sep 3 • 35

authored a paper 5 months ago

SEED-Story: Multimodal Long Story Generation with Large Language Model

Paper • 2407.08683 • Published Jul 11 • 22

authored a paper 8 months ago

SEED-X: Multimodal Models with Unified Multi-granularity Comprehension and Generation

Paper • 2404.14396 • Published Apr 22 • 18

authored a paper 10 months ago

Make a Cheap Scaling: A Self-Cascade Diffusion Model for Higher-Resolution Adaptation

Paper • 2402.10491 • Published Feb 16 • 17

upvoted a paper 11 months ago

Advances in 3D Generation: A Survey

Paper • 2401.17807 • Published Jan 31 • 17

authored 6 papers 11 months ago

Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities

Paper • 2401.14405 • Published Jan 25 • 11

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Paper • 2401.09047 • Published Jan 17 • 13

Towards A Better Metric for Text-to-Video Generation

Paper • 2401.07781 • Published Jan 15 • 14