fulong ye
Alon77777
AI & ML interests
vision and language, diffusion model, text-to-image generation, image-to-text generation, referring expression generation and comprehension
Recent Activity
upvoted
a
paper
21 days ago
OmniInsert: Mask-Free Video Insertion of Any Reference via Diffusion
Transformer Models
upvoted
a
paper
about 1 month ago
UMO: Scaling Multi-Identity Consistency for Image Customization via
Matching Reward
upvoted
a
paper
about 2 months ago
USO: Unified Style and Subject-Driven Generation via Disentangled and
Reward Learning