Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,34 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Color Diffusion (Evaluating Model Perception of Color Illusions in Photorealistic Scenes)
|
2 |
+
|
3 |
+
Authors: Lingjun Mao, Zineng Tang, Alane Suhr
|
4 |
+
|
5 |
+
---
|
6 |
+
|
7 |
+
![examples](https://github.com/mao1207/RCID/blob/main/images/color-diffusion.gif?raw=true)
|
8 |
+
|
9 |
+
|
10 |
+
## Model Overview
|
11 |
+
|
12 |
+
The **Color Diffusion** model used in the paper "Evaluating Model Perception of Color Illusions in Photorealistic Scenes" is designed to generate images for RCID dataset based on a color sketch. By simply providing the model with a colored draft image, it can generate realistic images that match both the shape and color patterns of the provided sketch, according to a given text prompt. This model is built upon ControlNet and has been trained for 20 epochs on the MS COCO 2017 dataset.
|
13 |
+
|
14 |
+
## RCID Dataset
|
15 |
+
|
16 |
+
![RCID](https://github.com/mao1207/RCID/blob/main/images/main_figure.png?raw=true)
|
17 |
+
|
18 |
+
The construction of our dataset involves three steps:
|
19 |
+
|
20 |
+
1. **Image Generation.** For contrast and stripe illusions, we use procedural code to generate simple illusion images, which are then processed by our **Color Diffusion** model to create realistic illusion images. For filter illusions, we directly apply contrasting color filters to the original images. Each type of illusion also includes a corresponding control group without any illusions for comparison.
|
21 |
+
|
22 |
+
2. **Question Generation.** We use GPT-4o to generate image-specific questions that are designed to evaluate the model's understanding of the illusion.
|
23 |
+
|
24 |
+
3. **Human Feedback.** We collect human participants' feedback on these images and adjust the original classification of “illusion” and “non-illusion” based on whether participants are deceived.
|
25 |
+
|
26 |
+
Our data can be found in the following link: [RCID Dataset](https://huggingface.co/datasets/mao1207/RCID)
|
27 |
+
|
28 |
+
The code is released on [https://github.com/mao1207/RCID](https://huggingface.co/datasets/mao1207/RCID)
|
29 |
+
|
30 |
+
## License
|
31 |
+
|
32 |
+
The source code of this repository is released under the Apache License 2.0. The model license and dataset license are listed on their corresponding webpages.
|
33 |
+
|
34 |
+
For more information, access to the dataset, and to contribute, please visit our [Website](https://color-illusion.github.io/Color-Illusion/).
|