DETR-R101-DC5 object detector, finetuned on PaintSkills Dataset for visual reasoning skill evaluation of text-to-image generation models.
Please check https://github.com/j-min/DallEval/tree/main/paintskills for the instruction for running skill evaluation with the DETR model.
- Paper: DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)
- Authors: Jaemin Cho, Abhay Zala, Mohit Bansal
@inproceedings{Cho2023DallEval,
title = {DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models},
author = {Jaemin Cho and Abhay Zala and Mohit Bansal},
year = {2023},
booktitle = {ICCV},
}