sakren commited on
Commit
92f0e16
1 Parent(s): ef5275e

sakren/debert-imeocap

Browse files
README.md ADDED
@@ -0,0 +1,80 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ base_model: microsoft/deberta-v3-base
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - f1
8
+ - precision
9
+ - recall
10
+ - accuracy
11
+ model-index:
12
+ - name: debert-imeocap
13
+ results: []
14
+ ---
15
+
16
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
17
+ should probably proofread and complete it, then remove this comment. -->
18
+
19
+ # debert-imeocap
20
+
21
+ This model is a fine-tuned version of [microsoft/deberta-v3-base](https://huggingface.co/microsoft/deberta-v3-base) on the None dataset.
22
+ It achieves the following results on the evaluation set:
23
+ - Loss: 1.3914
24
+ - F1: 0.6372
25
+ - Precision: 0.6448
26
+ - Recall: 0.6365
27
+ - Accuracy: 0.6365
28
+
29
+ ## Model description
30
+
31
+ More information needed
32
+
33
+ ## Intended uses & limitations
34
+
35
+ More information needed
36
+
37
+ ## Training and evaluation data
38
+
39
+ More information needed
40
+
41
+ ## Training procedure
42
+
43
+ ### Training hyperparameters
44
+
45
+ The following hyperparameters were used during training:
46
+ - learning_rate: 2e-05
47
+ - train_batch_size: 64
48
+ - eval_batch_size: 64
49
+ - seed: 42
50
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
51
+ - lr_scheduler_type: linear
52
+ - num_epochs: 15
53
+
54
+ ### Training results
55
+
56
+ | Training Loss | Epoch | Step | Validation Loss | F1 | Precision | Recall | Accuracy |
57
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:---------:|:------:|:--------:|
58
+ | 1.5405 | 1.0 | 74 | 1.4488 | 0.3206 | 0.2386 | 0.4885 | 0.4885 |
59
+ | 1.3156 | 2.0 | 148 | 1.1964 | 0.5541 | 0.5627 | 0.575 | 0.575 |
60
+ | 1.0728 | 3.0 | 222 | 1.1077 | 0.6001 | 0.6189 | 0.5981 | 0.5981 |
61
+ | 0.9239 | 4.0 | 296 | 1.0742 | 0.6324 | 0.6361 | 0.6365 | 0.6365 |
62
+ | 0.7802 | 5.0 | 370 | 1.0834 | 0.6073 | 0.6333 | 0.6058 | 0.6058 |
63
+ | 0.661 | 6.0 | 444 | 1.1733 | 0.5984 | 0.6166 | 0.5962 | 0.5962 |
64
+ | 0.602 | 7.0 | 518 | 1.1786 | 0.5911 | 0.6193 | 0.5885 | 0.5885 |
65
+ | 0.5391 | 8.0 | 592 | 1.2171 | 0.6156 | 0.6251 | 0.6154 | 0.6154 |
66
+ | 0.4815 | 9.0 | 666 | 1.2566 | 0.6259 | 0.6399 | 0.625 | 0.625 |
67
+ | 0.4548 | 10.0 | 740 | 1.2927 | 0.6233 | 0.6417 | 0.6212 | 0.6212 |
68
+ | 0.4538 | 11.0 | 814 | 1.2969 | 0.6385 | 0.6461 | 0.6385 | 0.6385 |
69
+ | 0.4119 | 12.0 | 888 | 1.3455 | 0.6376 | 0.6464 | 0.6365 | 0.6365 |
70
+ | 0.3968 | 13.0 | 962 | 1.3709 | 0.6304 | 0.6413 | 0.6288 | 0.6288 |
71
+ | 0.352 | 14.0 | 1036 | 1.3823 | 0.6246 | 0.6360 | 0.6231 | 0.6231 |
72
+ | 0.3551 | 15.0 | 1110 | 1.3914 | 0.6372 | 0.6448 | 0.6365 | 0.6365 |
73
+
74
+
75
+ ### Framework versions
76
+
77
+ - Transformers 4.39.3
78
+ - Pytorch 2.1.2
79
+ - Datasets 2.18.0
80
+ - Tokenizers 0.15.2
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:89a2de6caf761ebec1b2008179ccad8d78903208460fdcee2925b6688214192d
3
  size 737731584
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:75c1a12d049fc58bb39bf5d40f8feca989a9014974d00b8ff0b97a58ce857420
3
  size 737731584
runs/May15_17-29-43_95a092faa389/events.out.tfevents.1715794255.95a092faa389.35.7 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5f338dd8e97517c5e7f238638788469d40b14c867b96b66513a0da095c9705a9
3
- size 13878
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:48ecbbcbcdea54e916e92bc53b450e636e45581e7ccbb95f967053e56ffbd717
3
+ size 15598
runs/May15_17-29-43_95a092faa389/events.out.tfevents.1715795468.95a092faa389.35.8 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b135443e1f76934b756fb157e1ca195171fb3823baefc3678515330b63e28799
3
+ size 1032