Spaces:

sohojoe
/

soho-clip-embeddings-explorer

Running

App Files Files Community

Joe Booth commited on Feb 9, 2023

Commit

8498cb9

1 Parent(s): 5bbcb1c

clean up info text

Browse files

Files changed (1) hide show

app.py +8 -20

app.py CHANGED Viewed

@@ -73,10 +73,7 @@ def base64_to_embedding(embeddings_b64):
 def main(
     # input_im,
     embeddings,
-    scale=3.0,
     n_samples=4,
-    steps=25,
-    seed=None
     ):
     embeddings = base64_to_embedding(embeddings)
@@ -293,9 +290,9 @@ with gr.Blocks() as demo:
         with gr.Column(scale=5):
             gr.Markdown(
 """
-# Soho-Clip Embedding Explorer
-A tool for exploring CLIP embedding spaces.
 Try uploading a few images and/or add some text prompts and click generate images.
 """)
@@ -365,16 +362,10 @@ Try uploading a few images and/or add some text prompts and click generate image
         with gr.Accordion(f"Avergage embeddings in base 64", open=False):
             average_embedding_base64 = gr.Textbox(show_label=False)
     with gr.Row():
-        submit = gr.Button("Search embedding space")
-    with gr.Row():
-        with gr.Column(scale=1, min_width=200):
-            scale = gr.Slider(0, 25, value=3, step=1, label="Guidance scale")
         with gr.Column(scale=1, min_width=200):
             n_samples = gr.Slider(1, 16, value=4, step=1, label="Number images")
-        with gr.Column(scale=1, min_width=200):
-            steps = gr.Slider(5, 50, value=25, step=5, label="Steps")
-        with gr.Column(scale=1, min_width=200):
-            seed = gr.Number(None, label="Seed (blank = random)", precision=0)
     with gr.Row():
         output = gr.Gallery(label="Generated variations")
@@ -391,7 +382,7 @@ Try uploading a few images and/or add some text prompts and click generate image
     average_embedding_base64.change(on_embeddings_changed_update_plot, average_embedding_base64, average_embedding_plot)
     # submit.click(main, inputs= [embedding_base64s[0], scale, n_samples, steps, seed], outputs=output)
-    submit.click(main, inputs= [average_embedding_base64, scale, n_samples, steps, seed], outputs=output)
     output.style(grid=2)
     with gr.Row():
@@ -403,18 +394,15 @@ My interest is to use CLIP for image/video understanding (see [CLIP_visual-spati
 ### Initial Features
 - Combine up to 10 Images and/or text inputs to create an average embedding space.
-- View embedding spaces as graph
-- Generate a new image based on the average embedding space
 ### Known limitations
-- Text input is a little off (requires fine tuning and I'm having issues with that at the moment)
-- It can only generate a single image at a time
-- Not easy to use the sample images
 ### Acknowledgements
-- I heavily build on Justin Pinkney's [Experiments in Image Variation](https://www.justinpinkney.com/image-variation-experiments). Please credit them if you use this work.
 - [CLIP](https://openai.com/blog/clip/)
 - [Stable Diffusion](https://github.com/CompVis/stable-diffusion)

 def main(
     # input_im,
     embeddings,
     n_samples=4,
     ):
     embeddings = base64_to_embedding(embeddings)
         with gr.Column(scale=5):
             gr.Markdown(
 """
+# Soho-Clip Embeddings Explorer
+A tool for exploring CLIP embedding space.
 Try uploading a few images and/or add some text prompts and click generate images.
 """)
         with gr.Accordion(f"Avergage embeddings in base 64", open=False):
             average_embedding_base64 = gr.Textbox(show_label=False)
     with gr.Row():
         with gr.Column(scale=1, min_width=200):
             n_samples = gr.Slider(1, 16, value=4, step=1, label="Number images")
+        with gr.Column(scale=3, min_width=200):
+            submit = gr.Button("Search embedding space")
     with gr.Row():
         output = gr.Gallery(label="Generated variations")
     average_embedding_base64.change(on_embeddings_changed_update_plot, average_embedding_base64, average_embedding_plot)
     # submit.click(main, inputs= [embedding_base64s[0], scale, n_samples, steps, seed], outputs=output)
+    submit.click(main, inputs= [average_embedding_base64, n_samples], outputs=output)
     output.style(grid=2)
     with gr.Row():
 ### Initial Features
 - Combine up to 10 Images and/or text inputs to create an average embedding space.
+- Search the laion 5b immages via a knn search
 ### Known limitations
+- ...
 ### Acknowledgements
+- I heavily build on [clip-retrieval](https://rom1504.github.io/clip-retrieval/) and use their API. Please [citate](https://github.com/rom1504/clip-retrieval#citation) the authors if you use this work.
 - [CLIP](https://openai.com/blog/clip/)
 - [Stable Diffusion](https://github.com/CompVis/stable-diffusion)