Pathways on the Image Manifold: Image Editing via Video Generation
Abstract
Recent advances in image editing, driven by image diffusion models, have shown remarkable progress. However, significant challenges remain, as these models often struggle to follow complex edit instructions accurately and frequently compromise fidelity by altering key elements of the original image. Simultaneously, video generation has made remarkable strides, with models that effectively function as consistent and continuous world simulators. In this paper, we propose merging these two fields by utilizing image-to-video models for image editing. We reformulate image editing as a temporal process, using pretrained video models to create smooth transitions from the original image to the desired edit. This approach traverses the image manifold continuously, ensuring consistent edits while preserving the original image's key aspects. Our approach achieves state-of-the-art results on text-based image editing, demonstrating significant improvements in both edit accuracy and image preservation.
Community
TLDR: By traversing the image manifold using video generation models, we redefine image editing as a smooth temporal progression, achieving state-of-the-art edits while preserving the original image's defining characteristics.
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- SeedEdit: Align Image Re-Generation to Image Editing (2024)
- Vision-guided and Mask-enhanced Adaptive Denoising for Prompt-based Image Editing (2024)
- Transforming Static Images Using Generative Models for Video Salient Object Detection (2024)
- ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models (2024)
- TrojanEdit: Backdooring Text-Based Image Editing Models (2024)
- Stable Flow: Vital Layers for Training-Free Image Editing (2024)
- Multi-Reward as Condition for Instruction-based Image Editing (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper