finsynth
/

deberta-v3-base-financial-inc-dec-ner

@@ -22,10 +22,10 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.0416
-- Precision: 0.9291
 - Recall: 0.9704
-- F1: 0.9493
-- Accuracy: 0.9910
 ## Model description
@@ -44,24 +44,34 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 6
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| No log        | 1.0   | 184  | 0.0454          | 0.9154    | 0.8815 | 0.8981 | 0.9843   |
-| No log        | 2.0   | 368  | 0.0444          | 0.9220    | 0.9630 | 0.9420 | 0.9903   |
-| 0.0654        | 3.0   | 552  | 0.0416          | 0.9291    | 0.9704 | 0.9493 | 0.9910   |
-| 0.0654        | 4.0   | 736  | 0.0422          | 0.9489    | 0.9630 | 0.9559 | 0.9918   |
-| 0.0654        | 5.0   | 920  | 0.0451          | 0.9416    | 0.9556 | 0.9485 | 0.9910   |
-| 0.0064        | 6.0   | 1104 | 0.0461          | 0.9416    | 0.9556 | 0.9485 | 0.9910   |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Loss: 0.0416
+- Precision: 0.9632
 - Recall: 0.9704
+- F1: 0.9668
+- Accuracy: 0.9933
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 100
+- num_epochs: 15
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| No log        | 1.0   | 92   | 0.1193          | 0.625     | 0.7407 | 0.6780 | 0.9588   |
+| No log        | 2.0   | 184  | 0.0522          | 0.8643    | 0.8963 | 0.88   | 0.9798   |
+| No log        | 3.0   | 276  | 0.0554          | 0.8897    | 0.8963 | 0.8930 | 0.9835   |
+| No log        | 4.0   | 368  | 0.0362          | 0.9416    | 0.9556 | 0.9485 | 0.9910   |
+| No log        | 5.0   | 460  | 0.0315          | 0.9286    | 0.9630 | 0.9455 | 0.9918   |
+| 0.1731        | 6.0   | 552  | 0.0416          | 0.9632    | 0.9704 | 0.9668 | 0.9933   |
+| 0.1731        | 7.0   | 644  | 0.0496          | 0.9420    | 0.9630 | 0.9524 | 0.9910   |
+| 0.1731        | 8.0   | 736  | 0.0527          | 0.9420    | 0.9630 | 0.9524 | 0.9910   |
+| 0.1731        | 9.0   | 828  | 0.0604          | 0.9348    | 0.9556 | 0.9451 | 0.9895   |
+| 0.1731        | 10.0  | 920  | 0.0564          | 0.9420    | 0.9630 | 0.9524 | 0.9910   |
+| 0.0028        | 11.0  | 1012 | 0.0571          | 0.9493    | 0.9704 | 0.9597 | 0.9918   |
+| 0.0028        | 12.0  | 1104 | 0.0570          | 0.9493    | 0.9704 | 0.9597 | 0.9918   |
+| 0.0028        | 13.0  | 1196 | 0.0559          | 0.9493    | 0.9704 | 0.9597 | 0.9918   |
+| 0.0028        | 14.0  | 1288 | 0.0574          | 0.9493    | 0.9704 | 0.9597 | 0.9918   |
+| 0.0028        | 15.0  | 1380 | 0.0576          | 0.9493    | 0.9704 | 0.9597 | 0.9918   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:be5330642bc16ac596bf54d99684ef8047ee8e364aa30203e7788a4029e67a3b
 size 735359804

 version https://git-lfs.github.com/spec/v1
+oid sha256:c37998b35c5c917e035991f5b0c72727691be0df907956da6df5b0382a2c7db1
 size 735359804