mtolgakbaba commited on
Commit
66d0064
·
verified ·
1 Parent(s): d462c3a

distilbert-task-classifier

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 1.0877
20
 
21
  ## Model description
22
 
@@ -35,21 +35,23 @@ More information needed
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
- - learning_rate: 5e-05
39
- - train_batch_size: 4
40
- - eval_batch_size: 4
41
  - seed: 42
42
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: linear
44
- - num_epochs: 3
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | 0.8617 | 1.0 | 200 | 0.7855 |
51
- | 0.6712 | 2.0 | 400 | 0.8970 |
52
- | 0.4594 | 3.0 | 600 | 1.0877 |
 
 
53
 
54
 
55
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.4168
20
 
21
  ## Model description
22
 
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
+ - learning_rate: 2e-05
39
+ - train_batch_size: 8
40
+ - eval_batch_size: 8
41
  - seed: 42
42
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
43
  - lr_scheduler_type: linear
44
+ - num_epochs: 5
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | 0.8907 | 1.0 | 175 | 0.5684 |
51
+ | 0.4774 | 2.0 | 350 | 0.3421 |
52
+ | 0.2886 | 3.0 | 525 | 0.3073 |
53
+ | 0.1644 | 4.0 | 700 | 0.3875 |
54
+ | 0.0867 | 5.0 | 875 | 0.4168 |
55
 
56
 
57
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ea9b1bac5da285c78ed4ee1f09a4df969a430f35df1f5ca842a9aa61855d4b85
3
  size 267835644
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c748450fc5b7fcf534623fa98e650180b78376951a50d76d136d0eddd5b4d59b
3
  size 267835644
runs/Dec13_10-08-23_4dc25fb6cd72/events.out.tfevents.1734084512.4dc25fb6cd72.1275.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:361029b0e1fc45b976b31051a423172fcce4ca174a9b35370a6d9cba3f45e1d8
3
+ size 7028
runs/Dec13_10-14-13_4dc25fb6cd72/events.out.tfevents.1734084855.4dc25fb6cd72.1275.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:59845c80bc9dbd581361ff328017d3c4a1f38eb804a90efa38beb8959b0b0246
3
+ size 9301
runs/Dec13_10-15-38_4dc25fb6cd72/events.out.tfevents.1734084939.4dc25fb6cd72.1275.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ff5cf0af0df6d23a25fe513906d4edc81bb60336aea978251ba099bc9babe80a
3
+ size 7825
runs/Dec13_10-22-23_4dc25fb6cd72/events.out.tfevents.1734085348.4dc25fb6cd72.1275.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:19514e0420b9b3bbeb28a510e6145891881b89eb674f6e574fc8d3ab707479ed
3
+ size 7825
runs/Dec13_10-26-36_4dc25fb6cd72/events.out.tfevents.1734085598.4dc25fb6cd72.1275.4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fed94fad4f232a96b6de130155a3b15e357c0e402d28f36ddeee43b101c18784
3
+ size 7834
runs/Dec13_10-30-04_4dc25fb6cd72/events.out.tfevents.1734085806.4dc25fb6cd72.1275.5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ed6f6be29f4c5def6059f502ba21ff397eabb6bee3a68829761c84ae6123383b
3
+ size 6517
runs/Dec13_10-31-14_4dc25fb6cd72/events.out.tfevents.1734085874.4dc25fb6cd72.1275.6 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eae54c8c77e6f8aef0e21324bff1403c163ac31d06f9bab3322dd4a52e91e3e1
3
+ size 7835
runs/Dec13_10-40-56_4dc25fb6cd72/events.out.tfevents.1734086459.4dc25fb6cd72.1275.7 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:182ef2890cce061bb3e3c195989769fd753efce49eea69d6cdf1f6fecb4e23f3
3
+ size 6517
runs/Dec13_10-42-27_4dc25fb6cd72/events.out.tfevents.1734086549.4dc25fb6cd72.1275.8 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:13a458ed86081d4087b4b448c1f379955dbd61a82e0859f3f4aca684a5e3b74b
3
+ size 7826
runs/Dec13_10-49-16_4dc25fb6cd72/events.out.tfevents.1734086958.4dc25fb6cd72.1275.9 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:41d2d4a68191759bccd1152bcf2be808089a498b29f0710cf77a195d1982895b
3
+ size 7835
tokenizer.json CHANGED
@@ -6,7 +6,14 @@
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
- "padding": null,
 
 
 
 
 
 
 
10
  "added_tokens": [
11
  {
12
  "id": 0,
 
6
  "strategy": "LongestFirst",
7
  "stride": 0
8
  },
9
+ "padding": {
10
+ "strategy": "BatchLongest",
11
+ "direction": "Right",
12
+ "pad_to_multiple_of": null,
13
+ "pad_id": 0,
14
+ "pad_type_id": 0,
15
+ "pad_token": "[PAD]"
16
+ },
17
  "added_tokens": [
18
  {
19
  "id": 0,
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:36579bc743b4aced9f8eed665beb8e76bb486dc015eca4a470da2b09e7ae93df
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e1a75d20759f71a5e77492b25fd9fb9cbce981e65e316000236e57eaae1893f9
3
  size 5240