Apply for community grant: Personal project (gpu)

#1
by revp2024 - opened
Owner
โ€ข
edited May 28

ReVP: Reversible Visual Processing with Latent Models

TL;DR, Our results on images in-the-wild. Our method, ReVP, can hide an image inside another image. Consider various irreversible image processing operations, such as censorship and AI editing, we hide each original image in its processed version. Checkout our project page for more info!


As large pretrained models gain traction, we find that large pretrained autoencoders (AEs) are surprisingly suitable for image steganography. Unlike previous methods, our model performs message reconstruction in the latent space of an AE, enabling effortless training of a pair of steganographic encoder-decoder using a simple L2 loss. With a small number of additional tunable parameters (around 10% of the base AE), our method reconstructs intricate high-frequency details. To further examine our method in practical scenarios, we test it in two novel applications: reversible visual censorship and reversible AI editing. Additionally, we provide modules to enhance our model's robustness to cropping, quantization, and compression. Our method is straightforward and easy to implement. Despite its simplicity, it works effectively on both in-the-wild images and standard datasets across various applications, offering competitive perceptual quality compared to specialized models.

Hi @revp2024 , we've assigned ZeroGPU to this Space. Please check the compatibility and usage sections of this page so your Space can run on ZeroGPU.

Sign up or log in to comment