nathanReitinger commited on
Commit
b00b9f1
1 Parent(s): 71b5af2

End of training

Browse files
Files changed (1) hide show
  1. README.md +26 -16
README.md CHANGED
@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [google/vit-huge-patch14-224-in21k](https://huggingface.co/google/vit-huge-patch14-224-in21k) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 0.2821
21
- - Accuracy: 0.9034
22
 
23
  ## Model description
24
 
@@ -38,30 +38,40 @@ More information needed
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 5e-05
41
- - train_batch_size: 16
42
- - eval_batch_size: 16
43
  - seed: 42
44
  - gradient_accumulation_steps: 4
45
- - total_train_batch_size: 64
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
  - lr_scheduler_warmup_ratio: 0.1
49
- - num_epochs: 10
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
54
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
55
- | 0.5334 | 1.0 | 750 | 0.5595 | 0.8302 |
56
- | 0.3389 | 2.0 | 1500 | 0.4090 | 0.8602 |
57
- | 0.358 | 3.0 | 2250 | 0.3631 | 0.8717 |
58
- | 0.3672 | 4.0 | 3000 | 0.3368 | 0.8815 |
59
- | 0.3458 | 5.0 | 3750 | 0.3231 | 0.8842 |
60
- | 0.2721 | 6.0 | 4500 | 0.3075 | 0.8885 |
61
- | 0.2397 | 7.0 | 5250 | 0.3035 | 0.8899 |
62
- | 0.2779 | 8.0 | 6000 | 0.2893 | 0.8963 |
63
- | 0.2046 | 9.0 | 6750 | 0.2868 | 0.8991 |
64
- | 0.2599 | 10.0 | 7500 | 0.2821 | 0.9034 |
 
 
 
 
 
 
 
 
 
 
65
 
66
 
67
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [google/vit-huge-patch14-224-in21k](https://huggingface.co/google/vit-huge-patch14-224-in21k) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.2854
21
+ - Accuracy: 0.9055
22
 
23
  ## Model description
24
 
 
38
 
39
  The following hyperparameters were used during training:
40
  - learning_rate: 5e-05
41
+ - train_batch_size: 32
42
+ - eval_batch_size: 32
43
  - seed: 42
44
  - gradient_accumulation_steps: 4
45
+ - total_train_batch_size: 128
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
  - lr_scheduler_warmup_ratio: 0.1
49
+ - num_epochs: 20
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
54
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
55
+ | 0.7694 | 1.0 | 375 | 0.7344 | 0.8202 |
56
+ | 0.4737 | 2.0 | 750 | 0.5042 | 0.8457 |
57
+ | 0.4318 | 3.0 | 1125 | 0.3876 | 0.8702 |
58
+ | 0.3395 | 4.0 | 1500 | 0.3688 | 0.8769 |
59
+ | 0.3105 | 5.0 | 1875 | 0.3357 | 0.8845 |
60
+ | 0.2742 | 6.0 | 2250 | 0.3272 | 0.883 |
61
+ | 0.2898 | 7.0 | 2625 | 0.3156 | 0.8903 |
62
+ | 0.2774 | 8.0 | 3000 | 0.3004 | 0.8937 |
63
+ | 0.2833 | 9.0 | 3375 | 0.2976 | 0.8933 |
64
+ | 0.2398 | 10.0 | 3750 | 0.2954 | 0.8954 |
65
+ | 0.2143 | 11.0 | 4125 | 0.2724 | 0.9055 |
66
+ | 0.1808 | 12.0 | 4500 | 0.2843 | 0.8985 |
67
+ | 0.2298 | 13.0 | 4875 | 0.2918 | 0.8968 |
68
+ | 0.218 | 14.0 | 5250 | 0.2742 | 0.9036 |
69
+ | 0.1885 | 15.0 | 5625 | 0.2932 | 0.8976 |
70
+ | 0.1927 | 16.0 | 6000 | 0.2875 | 0.904 |
71
+ | 0.1546 | 17.0 | 6375 | 0.2832 | 0.9066 |
72
+ | 0.186 | 18.0 | 6750 | 0.2796 | 0.9054 |
73
+ | 0.1515 | 19.0 | 7125 | 0.2850 | 0.9018 |
74
+ | 0.1766 | 20.0 | 7500 | 0.2854 | 0.9055 |
75
 
76
 
77
  ### Framework versions