therealcyberlord commited on
Commit
d96b454
1 Parent(s): 81f244e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -2
README.md CHANGED
@@ -2,8 +2,21 @@
2
  license: apache-2.0
3
  ---
4
 
5
- # Vision image transformer fine-tuned on the stanford car dataset
6
 
7
  Base model: https://huggingface.co/google/vit-base-patch16-224
8
 
9
- This achieves around 82% on the testing set from my testing
 
 
 
 
 
 
 
 
 
 
 
 
 
 
2
  license: apache-2.0
3
  ---
4
 
5
+ # ViT Fine-tuned on Stanford Car Dataset
6
 
7
  Base model: https://huggingface.co/google/vit-base-patch16-224
8
 
9
+ This achieves around 82% on the testing set
10
+
11
+
12
+ Dataset Description:
13
+
14
+ The car dataset contains 16,185 images of 196 classes of cars. The data is split into 8,144 training images and 8,041 testing images. It is a popular choice in computer vision.
15
+
16
+
17
+
18
+ Citations:
19
+
20
+ 3D Object Representations for Fine-Grained Categorization
21
+ Jonathan Krause, Michael Stark, Jia Deng, Li Fei-Fei
22
+ 4th IEEE Workshop on 3D Representation and Recognition, at ICCV 2013 (3dRR-13). Sydney, Australia. Dec. 8, 2013.