winegarj
/

distilbert-base-uncased-finetuned-sst2

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

winegarj commited on Aug 13, 2022

Commit

1f0712a

•

1 Parent(s): 85cbf31

update model card README.md

Files changed (1) hide show

README.md +18 -16

README.md CHANGED Viewed

@@ -15,11 +15,13 @@ model-index:
     dataset:
       name: glue
       type: glue
       args: sst2
     metrics:
     - name: Accuracy
       type: accuracy
-      value: 0.908256880733945
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -29,8 +31,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the glue dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4493
-- Accuracy: 0.9083
 ## Model description
@@ -50,8 +52,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -59,18 +61,18 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Accuracy |
-|:-------------:|:-----:|:-----:|:---------------:|:--------:|
-| 0.1804        | 1.0   | 2105  | 0.2843          | 0.9025   |
-| 0.1216        | 2.0   | 4210  | 0.3242          | 0.9025   |
-| 0.0871        | 3.0   | 6315  | 0.3320          | 0.9060   |
-| 0.0607        | 4.0   | 8420  | 0.3913          | 0.9025   |
-| 0.0429        | 5.0   | 10525 | 0.4493          | 0.9083   |
 ### Framework versions
-- Transformers 4.18.0
-- Pytorch 1.12.0.dev20220409+cu115
-- Datasets 2.0.0
-- Tokenizers 0.12.0

     dataset:
       name: glue
       type: glue
+      config: sst2
+      split: train
       args: sst2
     metrics:
     - name: Accuracy
       type: accuracy
+      value: 0.9025229357798165
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the glue dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2823
+- Accuracy: 0.9025
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 512
+- eval_batch_size: 512
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:--------:|
+| No log        | 1.0   | 132  | 0.2528          | 0.8933   |
+| No log        | 2.0   | 264  | 0.2675          | 0.8979   |
+| No log        | 3.0   | 396  | 0.2823          | 0.9025   |
+| 0.1898        | 4.0   | 528  | 0.2986          | 0.8968   |
+| 0.1898        | 5.0   | 660  | 0.3029          | 0.9002   |
 ### Framework versions
+- Transformers 4.21.1
+- Pytorch 1.12.1+cu116
+- Datasets 2.4.0
+- Tokenizers 0.12.1