distilbert-task-classifier

Files changed (14) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0877
 ## Model description
@@ -35,21 +35,23 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.8617        | 1.0   | 200  | 0.7855          |
-| 0.6712        | 2.0   | 400  | 0.8970          |
-| 0.4594        | 3.0   | 600  | 1.0877          |
 ### Framework versions

 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4168
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.8907        | 1.0   | 175  | 0.5684          |
+| 0.4774        | 2.0   | 350  | 0.3421          |
+| 0.2886        | 3.0   | 525  | 0.3073          |
+| 0.1644        | 4.0   | 700  | 0.3875          |
+| 0.0867        | 5.0   | 875  | 0.4168          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ea9b1bac5da285c78ed4ee1f09a4df969a430f35df1f5ca842a9aa61855d4b85
 size 267835644

 version https://git-lfs.github.com/spec/v1
+oid sha256:c748450fc5b7fcf534623fa98e650180b78376951a50d76d136d0eddd5b4d59b
 size 267835644

runs/Dec13_10-08-23_4dc25fb6cd72/events.out.tfevents.1734084512.4dc25fb6cd72.1275.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:361029b0e1fc45b976b31051a423172fcce4ca174a9b35370a6d9cba3f45e1d8
+size 7028

runs/Dec13_10-14-13_4dc25fb6cd72/events.out.tfevents.1734084855.4dc25fb6cd72.1275.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:59845c80bc9dbd581361ff328017d3c4a1f38eb804a90efa38beb8959b0b0246
+size 9301

runs/Dec13_10-15-38_4dc25fb6cd72/events.out.tfevents.1734084939.4dc25fb6cd72.1275.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ff5cf0af0df6d23a25fe513906d4edc81bb60336aea978251ba099bc9babe80a
+size 7825

runs/Dec13_10-22-23_4dc25fb6cd72/events.out.tfevents.1734085348.4dc25fb6cd72.1275.3 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:19514e0420b9b3bbeb28a510e6145891881b89eb674f6e574fc8d3ab707479ed
+size 7825

runs/Dec13_10-26-36_4dc25fb6cd72/events.out.tfevents.1734085598.4dc25fb6cd72.1275.4 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:fed94fad4f232a96b6de130155a3b15e357c0e402d28f36ddeee43b101c18784
+size 7834

runs/Dec13_10-30-04_4dc25fb6cd72/events.out.tfevents.1734085806.4dc25fb6cd72.1275.5 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ed6f6be29f4c5def6059f502ba21ff397eabb6bee3a68829761c84ae6123383b
+size 6517

runs/Dec13_10-31-14_4dc25fb6cd72/events.out.tfevents.1734085874.4dc25fb6cd72.1275.6 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:eae54c8c77e6f8aef0e21324bff1403c163ac31d06f9bab3322dd4a52e91e3e1
+size 7835

runs/Dec13_10-40-56_4dc25fb6cd72/events.out.tfevents.1734086459.4dc25fb6cd72.1275.7 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:182ef2890cce061bb3e3c195989769fd753efce49eea69d6cdf1f6fecb4e23f3
+size 6517

runs/Dec13_10-42-27_4dc25fb6cd72/events.out.tfevents.1734086549.4dc25fb6cd72.1275.8 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:13a458ed86081d4087b4b448c1f379955dbd61a82e0859f3f4aca684a5e3b74b
+size 7826

runs/Dec13_10-49-16_4dc25fb6cd72/events.out.tfevents.1734086958.4dc25fb6cd72.1275.9 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:41d2d4a68191759bccd1152bcf2be808089a498b29f0710cf77a195d1982895b
+size 7835

tokenizer.json CHANGED Viewed

@@ -6,7 +6,14 @@
     "strategy": "LongestFirst",
     "stride": 0
   },
-  "padding": null,
   "added_tokens": [
     {
       "id": 0,

     "strategy": "LongestFirst",
     "stride": 0
   },
+  "padding": {
+    "strategy": "BatchLongest",
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 0,
+    "pad_type_id": 0,
+    "pad_token": "[PAD]"
+  },
   "added_tokens": [
     {
       "id": 0,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:36579bc743b4aced9f8eed665beb8e76bb486dc015eca4a470da2b09e7ae93df
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:e1a75d20759f71a5e77492b25fd9fb9cbce981e65e316000236e57eaae1893f9
 size 5240