5 27 71

Maitreya Patel

mpatel57

https://maitreyapatel.com

AI & ML interests

Text to Image Generative Models, Vision-Language Representation Learning

Recent Activity

liked a dataset 5 days ago

tiange/Cap3D

liked a Space 7 days ago

Intel/UnlearnDiffAtk-Benchmark

upvoted a paper 8 days ago

An Empirical Study of Autoregressive Pre-training from Videos

View all activity

Organizations

mpatel57's activity

upvoted a paper 8 days ago

An Empirical Study of Autoregressive Pre-training from Videos

Paper • 2501.05453 • Published 9 days ago • 36

upvoted 2 papers about 2 months ago

Steering Rectified Flow Models in the Vector Field for Controlled Image Generation

Paper • 2412.00100 • Published Nov 27, 2024 • 16

TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives

Paper • 2411.02545 • Published Nov 4, 2024 • 1

upvoted 3 collections about 2 months ago

upvoted a paper 4 months ago

Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing

Paper • 2409.01322 • Published Sep 2, 2024 • 95

upvoted a paper 5 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 156

upvoted 4 papers 6 months ago

TurboEdit: Text-Based Image Editing Using Few-Step Diffusion Models

Paper • 2408.00735 • Published Aug 1, 2024 • 16

SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement

Paper • 2408.00653 • Published Aug 1, 2024 • 29

Tora: Trajectory-oriented Diffusion Transformer for Video Generation

Paper • 2407.21705 • Published Jul 31, 2024 • 27

FreeLong: Training-Free Long Video Generation with SpectralBlend Temporal Attention

Paper • 2407.19918 • Published Jul 29, 2024 • 49

upvoted 2 papers 7 months ago

An Image is Worth 32 Tokens for Reconstruction and Generation

Paper • 2406.07550 • Published Jun 11, 2024 • 57

Zero-shot Image Editing with Reference Imitation

Paper • 2406.07547 • Published Jun 11, 2024 • 32

upvoted 2 papers 10 months ago

Bigger is not Always Better: Scaling Properties of Latent Diffusion Models

Paper • 2404.01367 • Published Apr 1, 2024 • 21

Streaming Dense Video Captioning

Paper • 2404.01297 • Published Apr 1, 2024 • 12

upvoted a collection 10 months ago

Representative Papers

Collection

Collection of research papers published by the organization members • 4 items • Updated Mar 30, 2024 • 1

upvoted a collection 11 months ago

ECLIPSE Series Priors

Collection

ECLIPSE priors for kandinsky v2.2 for T2I and Personalized T2I. • 3 items • Updated Apr 12, 2024 • 1

upvoted a paper 11 months ago

Magic-Me: Identity-Specific Video Customized Diffusion

Paper • 2402.09368 • Published Feb 14, 2024 • 28

upvoted a collection 11 months ago

ECLIPSE Stack

Collection

5 items • Updated Aug 7, 2024 • 1