Model Card for AVA Image Clip Embeddings
The AVA image dataset is a collection of digital photos with ratings. It was used to create the visual scorer that evaluated the images in Laion 5B to create the the Laion-Aesthetics dataset
https://github.com/imfing/ava_downloader/
“AVA: A Large-Scale Database for Aesthetic Visual Analysis”.
Naila Murray, Luca Marchesotti, Florent Perronnin, CVPR 2012.
New aesthetics scorer: https://github.com/kenjiqq/aesthetics-scorer/
Original aesthetics scorer: https://github.com/christophschuhmann/improved-aesthetic-predictor/
They were processed with OpenClip BigG-14, L-14, and H-14 models.
"laion/CLIP-ViT-bigG-14-laion2B-39B-b160k"
"laion/CLIP-ViT-H-14-laion2B-s32B-b79K"
"laion/CLIP-ViT-L-14-laion2B-s32B-b82K"
https://github.com/mlfoundations/open_clip
Not all images were processed!
Refer to the parquet for the succesfully processed images.
The parquet fields are
- "image_name", #same id as AVA csv
- "pooled_output"
- "projected_embedding"