dennishauser commited on
Commit
c71698b
1 Parent(s): 234df19

End of training

Browse files
README.md ADDED
@@ -0,0 +1,68 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - generated_from_trainer
4
+ metrics:
5
+ - accuracy
6
+ model-index:
7
+ - name: ViT_MNIST
8
+ results: []
9
+ ---
10
+
11
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
+ should probably proofread and complete it, then remove this comment. -->
13
+
14
+ # ViT_MNIST
15
+
16
+ This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
17
+ It achieves the following results on the evaluation set:
18
+ - Loss: 0.2367
19
+ - Accuracy: 0.9379
20
+
21
+ ## Model description
22
+
23
+ More information needed
24
+
25
+ ## Intended uses & limitations
26
+
27
+ More information needed
28
+
29
+ ## Training and evaluation data
30
+
31
+ More information needed
32
+
33
+ ## Training procedure
34
+
35
+ ### Training hyperparameters
36
+
37
+ The following hyperparameters were used during training:
38
+ - learning_rate: 5e-05
39
+ - train_batch_size: 5500
40
+ - eval_batch_size: 5500
41
+ - seed: 42
42
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
+ - lr_scheduler_type: linear
44
+ - lr_scheduler_warmup_ratio: 0.1
45
+ - num_epochs: 10
46
+
47
+ ### Training results
48
+
49
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
50
+ |:-------------:|:-----:|:----:|:---------------:|:--------:|
51
+ | 0.3795 | 1.0 | 11 | 0.3894 | 0.897 |
52
+ | 0.3668 | 2.0 | 22 | 0.3547 | 0.9059 |
53
+ | 0.3441 | 3.0 | 33 | 0.3186 | 0.9174 |
54
+ | 0.3163 | 4.0 | 44 | 0.2998 | 0.9235 |
55
+ | 0.299 | 5.0 | 55 | 0.2860 | 0.9259 |
56
+ | 0.2788 | 6.0 | 66 | 0.2770 | 0.9291 |
57
+ | 0.2684 | 7.0 | 77 | 0.2553 | 0.9342 |
58
+ | 0.2579 | 8.0 | 88 | 0.2545 | 0.9338 |
59
+ | 0.2449 | 9.0 | 99 | 0.2403 | 0.9378 |
60
+ | 0.2322 | 10.0 | 110 | 0.2367 | 0.9379 |
61
+
62
+
63
+ ### Framework versions
64
+
65
+ - Transformers 4.42.4
66
+ - Pytorch 2.4.0+cu121
67
+ - Datasets 2.21.0
68
+ - Tokenizers 0.19.1
config.json ADDED
@@ -0,0 +1,47 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "ViTForImageClassification"
4
+ ],
5
+ "attention_probs_dropout_prob": 0.0,
6
+ "encoder_stride": 16,
7
+ "hidden_act": "gelu",
8
+ "hidden_dropout_prob": 0.0,
9
+ "hidden_size": 256,
10
+ "id2label": {
11
+ "0": "0",
12
+ "1": "1",
13
+ "2": "2",
14
+ "3": "3",
15
+ "4": "4",
16
+ "5": "5",
17
+ "6": "6",
18
+ "7": "7",
19
+ "8": "8",
20
+ "9": "9"
21
+ },
22
+ "image_size": 28,
23
+ "initializer_range": 0.02,
24
+ "intermediate_size": 3072,
25
+ "label2id": {
26
+ "0": "0",
27
+ "1": "1",
28
+ "2": "2",
29
+ "3": "3",
30
+ "4": "4",
31
+ "5": "5",
32
+ "6": "6",
33
+ "7": "7",
34
+ "8": "8",
35
+ "9": "9"
36
+ },
37
+ "layer_norm_eps": 1e-12,
38
+ "model_type": "vit",
39
+ "num_attention_heads": 8,
40
+ "num_channels": 1,
41
+ "num_hidden_layers": 3,
42
+ "patch_size": 7,
43
+ "problem_type": "single_label_classification",
44
+ "qkv_bias": true,
45
+ "torch_dtype": "float32",
46
+ "transformers_version": "4.42.4"
47
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:369d34987db52b93b31fa73900e9ccad61073de2a34f19e85d41f1a2456bd2b6
3
+ size 22172808
preprocessor_config.json ADDED
@@ -0,0 +1,22 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "do_normalize": true,
3
+ "do_rescale": true,
4
+ "do_resize": true,
5
+ "image_mean": [
6
+ 0.5,
7
+ 0.5,
8
+ 0.5
9
+ ],
10
+ "image_processor_type": "ViTImageProcessor",
11
+ "image_std": [
12
+ 0.5,
13
+ 0.5,
14
+ 0.5
15
+ ],
16
+ "resample": 2,
17
+ "rescale_factor": 0.00392156862745098,
18
+ "size": {
19
+ "height": 28,
20
+ "width": 28
21
+ }
22
+ }
runs/Sep01_14-24-19_03a1b1bb25ab/events.out.tfevents.1725200665.03a1b1bb25ab.551.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eced0aa46b7e0f7f305158351c0e8e64cb1d3644a9574ff01da192f6c03f4cfb
3
+ size 346
runs/Sep01_14-33-24_03a1b1bb25ab/events.out.tfevents.1725201216.03a1b1bb25ab.551.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:522e489612028d9303cf8e61dcd73bf606a5da7e92e345b2a841f771ae204007
3
+ size 346
runs/Sep01_17-41-48_03a1b1bb25ab/events.out.tfevents.1725214638.03a1b1bb25ab.551.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7902d71732a7fd85e3c2f4ac3b9f85e7cb477ab003c028f40bf06381c78450ae
3
+ size 7898
runs/Sep01_17-41-48_03a1b1bb25ab/events.out.tfevents.1725214779.03a1b1bb25ab.551.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:10d6b08e42385e1f1be5f9e33f2e2264fdf201c1df2018806ae0d6b222e67cf6
3
+ size 405
runs/Sep01_18-27-44_03a1b1bb25ab/events.out.tfevents.1725215266.03a1b1bb25ab.551.4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e17c7b110cad6f2f63be1d0dacd875a6626f94c3e58ab84e7bc0f457f73ae6e9
3
+ size 10861
runs/Sep01_18-27-44_03a1b1bb25ab/events.out.tfevents.1725215575.03a1b1bb25ab.551.5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ac60ef6798590295fd0aec8cba635c81efa4cf72ef8dbea96d7d61837a1c31a2
3
+ size 405
runs/Sep01_18-52-02_03a1b1bb25ab/events.out.tfevents.1725216754.03a1b1bb25ab.551.6 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:20cb460dd5b386509a9b90006ccff265d34531eff95cfd9482aacd806ab03a3f
3
+ size 10861
runs/Sep01_18-52-02_03a1b1bb25ab/events.out.tfevents.1725216990.03a1b1bb25ab.551.7 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4926eb3793e425cf1b92b59ade8c03452a2ab5f77dea80655f801920d55b6f98
3
+ size 405
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:16069f3528c9f630eb292a82039658e9c6426ed52daef602e7ace49a6e262adc
3
+ size 5112