therealcyberlord
/

stanford-car-vit-patch16

Image Classification

Inference Endpoints

Model card Files Files and versions Community

therealcyberlord commited on Aug 8, 2022

Commit

d96b454

•

1 Parent(s): 81f244e

Update README.md

Files changed (1) hide show

README.md +15 -2

README.md CHANGED Viewed

@@ -2,8 +2,21 @@
 license: apache-2.0
 ---
-# Vision image transformer fine-tuned on the stanford car dataset
 Base model: https://huggingface.co/google/vit-base-patch16-224
-This achieves around 82% on the testing set from my testing

 license: apache-2.0
 ---
+# ViT Fine-tuned on Stanford Car Dataset
 Base model: https://huggingface.co/google/vit-base-patch16-224
+This achieves around 82% on the testing set
+Dataset Description:
+The car dataset contains 16,185 images of 196 classes of cars. The data is split into 8,144 training images and 8,041 testing images. It is a popular choice in computer vision.
+Citations:
+3D Object Representations for Fine-Grained Categorization
+Jonathan Krause, Michael Stark, Jia Deng, Li Fei-Fei
+4th IEEE Workshop on 3D Representation and Recognition, at ICCV 2013 (3dRR-13). Sydney, Australia. Dec. 8, 2013.