therealcyberlord commited on
Commit
85ebd16
1 Parent(s): 86e95ff

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -9
README.md CHANGED
@@ -8,12 +8,27 @@ metrics:
8
  - recall
9
  ---
10
 
11
- Fine-tuned ViT on the Indian Food Dataset: https://huggingface.co/datasets/bharat-raghunathan/indian-foods-dataset
12
-
13
- Evaluation metrics on the testing set (961 images):
14
-
15
- • accuracy: 0.9667
16
-
17
- precision: 0.9670
18
-
19
- • recall: 0.9667
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
  - recall
9
  ---
10
 
11
+ # Indian Food Classification with Vision Transformer (ViT)
12
+
13
+ ## Overview
14
+ This model is a fine-tuned Vision Transformer (ViT) for the task of classifying images of Indian foods. The model was trained on the [Indian Foods Dataset](https://huggingface.co/datasets/bharat-raghunathan/indian-foods-dataset) from Hugging Face Datasets.
15
+
16
+ ## Dataset
17
+ The Indian Foods Dataset contains 4,770 images across 15 different classes of popular Indian dishes. The dataset is split into:
18
+
19
+ - Training: 3,047 images
20
+ - Validation: 762 images
21
+ - Testing: 961 images
22
+
23
+ ## Model
24
+ The base model used is the vision transformer (google/vit-base-patch16-224-in21k). The model was fine-tuned on the Indian Foods Dataset for 10 epochs using the AdamW optimizer with a learning rate of 2e-4.
25
+
26
+ ## Evaluation
27
+ The model was evaluated on the test set and achieved the following metrics:
28
+
29
+ - Accuracy: 0.9667
30
+ - Precision: 0.9670
31
+ - Recall: 0.9667
32
+
33
+ ## Usage
34
+ You can use this pre-trained model directly from Hugging Face