metadata

tags:
  - autotrain
  - image-classification
widget:
  - src: >-
      https://huggingface.co/datasets/mishig/sample_images/resolve/main/tiger.jpg
    example_title: Tiger
  - src: >-
      https://huggingface.co/datasets/mishig/sample_images/resolve/main/teapot.jpg
    example_title: Teapot
  - src: >-
      https://huggingface.co/datasets/mishig/sample_images/resolve/main/palace.jpg
    example_title: Palace
datasets:
  - Colby/autotrain-data-sdxl-detection
license: cc-by-nc-3.0

SDXL Detector

This model was created by fine-tuning the umm-maybe AI art detector on a dataset of Wikimedia-SDXL image pairs, where the SDXL image is generated using a prompt based upon a BLIP-generated caption describing the Wikimedia image.

This model demonstrates greatly improved performance over the umm-maybe detector on images generated by more recent diffusion models as well as non-artistic imagery (given the broader range of subjects depicted in the random sample drawn from Wikimedia).

However, its performance may be lower for images generated using models other than SDXL. In particular, this model underperforms the original detector for images generated using older models (such as VQGAN+CLIP).

The data used for this fine-tune is either synthetic (generated by SDXL) and therefore non-copyrightable, or downloaded from Wikimedia and therefore meeting their definition of "free data" (see https://commons.wikimedia.org/wiki/Commons:Licensing for details). However, the original umm-maybe AI art detector was trained on data scraped from image links in Reddit posts, some of which may be copyrighted. Therefore this model as well as its predecessor should be considered appropriate for non-commercial (i.e. personal or educational) fair uses only.

Model Trained Using AutoTrain

Problem type: Image Classification

Validation Metrics

loss: 0.08717025071382523

f1: 0.9732620320855615

precision: 0.994535519125683

recall: 0.9528795811518325

auc: 0.9980461893059392

accuracy: 0.9812734082397003