Eval Results
markweber commited on
Commit
345a906
·
verified ·
1 Parent(s): d994cf6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -3
README.md CHANGED
@@ -1,3 +1,36 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - ILSVRC/imagenet-1k
5
+ model-index:
6
+ - name: Taming-VQGAN
7
+ results:
8
+ - task:
9
+ type: image-generation
10
+ dataset:
11
+ name: ILSVRC/imagenet-1k
12
+ type: ILSVRC/imagenet-1k
13
+ metrics:
14
+ - name: rFID
15
+ type: rFID
16
+ value: 7.96
17
+ - name: InceptionScore
18
+ type: InceptionScore
19
+ value: 115.9
20
+ - name: LPIPS
21
+ type: LPIPS
22
+ value: 0.306
23
+ - name: PSNR
24
+ type: PSNR
25
+ value: 20.2
26
+ - name: SSIM
27
+ type: SSIM
28
+ value: 0.52
29
+ - name: CodebookUsage
30
+ type: CodebookUsage
31
+ value: 0.445
32
+ ---
33
+
34
+ This model is the Taming VQGAN tokenizer with a vocabulary size of 10bits converted into a format for the MaskBit codebase. It uses a downsampling factor of 16 and is trained on ImageNet for images of resolution 256.
35
+
36
+ You can find more details on the VQGAN in the original [repository](https://github.com/CompVis/taming-transformers) or [paper](https://arxiv.org/abs/2012.09841). All credits for this model belong to Patrick Esser, Robin Rombach and Björn Ommer.