sento800
/

distilbert-base-cased-squad

Question Answering

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

sento800 commited on Sep 9, 2023

Commit

e1fcf4c

·

1 Parent(s): 73a5631

End of training

Files changed (3) hide show

README.md +11 -11
pytorch_model.bin +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on the squad dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3039
 ## Model description
@@ -37,8 +37,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 10
-- eval_batch_size: 10
 - seed: 0
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -46,14 +46,14 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 2.3688        | 1.0   | 1350 | 1.4871          |
-| 1.2342        | 2.0   | 2700 | 1.3039          |
-| 0.8185        | 3.0   | 4050 | 1.3300          |
-| 0.5646        | 4.0   | 5400 | 1.5053          |
-| 0.4032        | 5.0   | 6750 | 1.6240          |
-| 0.314         | 6.0   | 8100 | 1.7492          |
 ### Framework versions

 This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on the squad dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4144
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 0
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 0.8764        | 1.0   | 1688  | 1.4144          |
+| 0.5653        | 2.0   | 3376  | 1.4274          |
+| 0.3689        | 3.0   | 5064  | 1.7517          |
+| 0.2427        | 4.0   | 6752  | 2.0817          |
+| 0.1613        | 5.0   | 8440  | 2.3114          |
+| 0.1159        | 6.0   | 10128 | 2.5096          |
 ### Framework versions

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:30216fcef907cc24138f696c613094ccc5372bc7cdfd3bab8044c8f5c38d8f2a
 size 260804645

 version https://git-lfs.github.com/spec/v1
+oid sha256:544761ce051ec8b0f0547d0acd8920ca14f70a88dd64c88e0d40107f415e98c9
 size 260804645

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:af7b40f158b6d030a2d700d330bd149279208cf495c432ef44de4b5a809ea808
 size 4027

 version https://git-lfs.github.com/spec/v1
+oid sha256:42a32534d20529529e0e92904bd91412ef23591255c36bfc1e82d328f7d246ab
 size 4027