mao1207 commited on
Commit
8c3b3b3
·
verified ·
1 Parent(s): 9e0ca58

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Color Diffusion (Evaluating Model Perception of Color Illusions in Photorealistic Scenes)
2
+
3
+ Authors: Lingjun Mao, Zineng Tang, Alane Suhr
4
+
5
+ ---
6
+
7
+ ![examples](https://github.com/mao1207/RCID/blob/main/images/color-diffusion.gif?raw=true)
8
+
9
+
10
+ ## Model Overview
11
+
12
+ The **Color Diffusion** model used in the paper "Evaluating Model Perception of Color Illusions in Photorealistic Scenes" is designed to generate images for RCID dataset based on a color sketch. By simply providing the model with a colored draft image, it can generate realistic images that match both the shape and color patterns of the provided sketch, according to a given text prompt. This model is built upon ControlNet and has been trained for 20 epochs on the MS COCO 2017 dataset.
13
+
14
+ ## RCID Dataset
15
+
16
+ ![RCID](https://github.com/mao1207/RCID/blob/main/images/main_figure.png?raw=true)
17
+
18
+ The construction of our dataset involves three steps:
19
+
20
+ 1. **Image Generation.** For contrast and stripe illusions, we use procedural code to generate simple illusion images, which are then processed by our **Color Diffusion** model to create realistic illusion images. For filter illusions, we directly apply contrasting color filters to the original images. Each type of illusion also includes a corresponding control group without any illusions for comparison.
21
+
22
+ 2. **Question Generation.** We use GPT-4o to generate image-specific questions that are designed to evaluate the model's understanding of the illusion.
23
+
24
+ 3. **Human Feedback.** We collect human participants' feedback on these images and adjust the original classification of “illusion” and “non-illusion” based on whether participants are deceived.
25
+
26
+ Our data can be found in the following link: [RCID Dataset](https://huggingface.co/datasets/mao1207/RCID)
27
+
28
+ The code is released on [https://github.com/mao1207/RCID](https://huggingface.co/datasets/mao1207/RCID)
29
+
30
+ ## License
31
+
32
+ The source code of this repository is released under the Apache License 2.0. The model license and dataset license are listed on their corresponding webpages.
33
+
34
+ For more information, access to the dataset, and to contribute, please visit our [Website](https://color-illusion.github.io/Color-Illusion/).