An Empirical Study of Autoregressive Pre-training from Videos Paper • 2501.05453 • Published 9 days ago • 36
Steering Rectified Flow Models in the Vector Field for Controlled Image Generation Paper • 2412.00100 • Published Nov 27, 2024 • 16
TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives Paper • 2411.02545 • Published Nov 4, 2024 • 1
Benchmark Checkpoints Collection Weights of the TripletCLIP and baselines on custom training scripts • 9 items • Updated Dec 1, 2024 • 2
FlowChef Collection Steering Rectified Flow Models in the Vector Field for Controlled Image Generation • 3 items • Updated Nov 30, 2024 • 1
Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing Paper • 2409.01322 • Published Sep 2, 2024 • 95
Transformer Explainer: Interactive Learning of Text-Generative Models Paper • 2408.04619 • Published Aug 8, 2024 • 156
TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models Paper • 2408.00735 • Published Aug 1, 2024 • 16
SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement Paper • 2408.00653 • Published Aug 1, 2024 • 29
Tora: Trajectory-oriented Diffusion Transformer for Video Generation Paper • 2407.21705 • Published Jul 31, 2024 • 27
FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention Paper • 2407.19918 • Published Jul 29, 2024 • 49
An Image is Worth 32 Tokens for Reconstruction and Generation Paper • 2406.07550 • Published Jun 11, 2024 • 57
Bigger is not Always Better: Scaling Properties of Latent Diffusion Models Paper • 2404.01367 • Published Apr 1, 2024 • 21
Representative Papers Collection Collection of research papers published by the organization members • 4 items • Updated Mar 30, 2024 • 1
ECLIPSE Series Priors Collection ECLIPSE priors for kandinsky v2.2 for T2I and Personalized T2I. • 3 items • Updated Apr 12, 2024 • 1
Magic-Me: Identity-Specific Video Customized Diffusion Paper • 2402.09368 • Published Feb 14, 2024 • 28