New version with explicit predicate marking

Browse files

Files changed (4) hide show

README.md +37 -40
config.json +42 -44
pytorch_model.bin +2 -2
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -15,42 +15,42 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [cointegrated/rubert-tiny2](https://huggingface.co/cointegrated/rubert-tiny2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1559
-- Addressee Precision: 0.7778
 - Addressee Recall: 0.875
-- Addressee F1: 0.8235
 - Addressee Number: 8
 - Benefactive Precision: 0.0
 - Benefactive Recall: 0.0
 - Benefactive F1: 0.0
 - Benefactive Number: 2
-- Causator Precision: 0.8667
 - Causator Recall: 0.8125
-- Causator F1: 0.8387
 - Causator Number: 16
-- Cause Precision: 0.5
-- Cause Recall: 0.1667
-- Cause F1: 0.25
 - Cause Number: 12
-- Contrsubject Precision: 0.7333
-- Contrsubject Recall: 0.6471
-- Contrsubject F1: 0.6875
 - Contrsubject Number: 17
-- Deliberative Precision: 0.8333
-- Deliberative Recall: 0.8333
-- Deliberative F1: 0.8333
 - Deliberative Number: 6
 - Destinative Precision: 1.0
-- Destinative Recall: 0.75
-- Destinative F1: 0.8571
 - Destinative Number: 4
 - Directivefinal Precision: 1.0
 - Directivefinal Recall: 1.0
 - Directivefinal F1: 1.0
 - Directivefinal Number: 2
-- Experiencer Precision: 0.7870
-- Experiencer Recall: 0.8947
-- Experiencer F1: 0.8374
 - Experiencer Number: 95
 - Instrument Precision: 0.0
 - Instrument Recall: 0.0
@@ -60,18 +60,14 @@ It achieves the following results on the evaluation set:
 - Limitative Recall: 0.0
 - Limitative F1: 0.0
 - Limitative Number: 1
-- Object Precision: 0.7714
-- Object Recall: 0.7875
-- Object F1: 0.7794
 - Object Number: 240
-- Predicate Precision: 0.9928
-- Predicate Recall: 1.0
-- Predicate F1: 0.9964
-- Predicate Number: 274
-- Overall Precision: 0.8653
-- Overall Recall: 0.8691
-- Overall F1: 0.8672
-- Overall Accuracy: 0.9566
 ## Model description
@@ -90,24 +86,25 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0001722734324185955
-- train_batch_size: 8
 - eval_batch_size: 1
-- seed: 815951
 - gradient_accumulation_steps: 4
-- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.04
-- num_epochs: 3
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Addressee Precision | Addressee Recall | Addressee F1 | Addressee Number | Benefactive Precision | Benefactive Recall | Benefactive F1 | Benefactive Number | Causator Precision | Causator Recall | Causator F1 | Causator Number | Cause Precision | Cause Recall | Cause F1 | Cause Number | Contrsubject Precision | Contrsubject Recall | Contrsubject F1 | Contrsubject Number | Deliberative Precision | Deliberative Recall | Deliberative F1 | Deliberative Number | Destinative Precision | Destinative Recall | Destinative F1 | Destinative Number | Directivefinal Precision | Directivefinal Recall | Directivefinal F1 | Directivefinal Number | Experiencer Precision | Experiencer Recall | Experiencer F1 | Experiencer Number | Instrument Precision | Instrument Recall | Instrument F1 | Instrument Number | Limitative Precision | Limitative Recall | Limitative F1 | Limitative Number | Object Precision | Object Recall | Object F1 | Object Number | Predicate Precision | Predicate Recall | Predicate F1 | Predicate Number | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy |
-|:-------------:|:-----:|:----:|:---------------:|:-------------------:|:----------------:|:------------:|:----------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------:|:---------------:|:-----------:|:---------------:|:---------------:|:------------:|:--------:|:------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------------:|:---------------------:|:-----------------:|:---------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:--------------------:|:-----------------:|:-------------:|:-----------------:|:--------------------:|:-----------------:|:-------------:|:-----------------:|:----------------:|:-------------:|:---------:|:-------------:|:-------------------:|:----------------:|:------------:|:----------------:|:-----------------:|:--------------:|:----------:|:----------------:|
-| 0.2353        | 1.0   | 245  | 0.1957          | 0.6                 | 0.375            | 0.4615       | 8                | 0.0                   | 0.0                | 0.0            | 2                  | 0.7368             | 0.875           | 0.8000      | 16              | 0.2857          | 0.1667       | 0.2105   | 12           | 0.6667                 | 0.2353              | 0.3478          | 17                  | 0.0                    | 0.0                 | 0.0             | 6                   | 0.0                   | 0.0                | 0.0            | 4                  | 0.0                      | 0.0                   | 0.0               | 2                     | 0.7477                | 0.8421             | 0.7921         | 95                 | 0.0                  | 0.0               | 0.0           | 3                 | 0.0                  | 0.0               | 0.0           | 1                 | 0.7696           | 0.6958        | 0.7309    | 240           | 0.9783              | 0.9854           | 0.9818       | 274              | 0.8477            | 0.7941         | 0.8200     | 0.9466           |
-| 0.148         | 2.0   | 491  | 0.1635          | 0.7                 | 0.875            | 0.7778       | 8                | 0.0                   | 0.0                | 0.0            | 2                  | 0.8333             | 0.9375          | 0.8824      | 16              | 0.4             | 0.1667       | 0.2353   | 12           | 0.6923                 | 0.5294              | 0.6000          | 17                  | 0.8                    | 0.6667              | 0.7273          | 6                   | 1.0                   | 0.25               | 0.4            | 4                  | 1.0                      | 1.0                   | 1.0               | 2                     | 0.7436                | 0.9158             | 0.8208         | 95                 | 0.0                  | 0.0               | 0.0           | 3                 | 0.0                  | 0.0               | 0.0           | 1                 | 0.7864           | 0.7208        | 0.7522    | 240           | 0.9892              | 1.0              | 0.9946       | 274              | 0.8593            | 0.8441         | 0.8516     | 0.9538           |
-| 0.1046        | 2.99  | 735  | 0.1559          | 0.7778              | 0.875            | 0.8235       | 8                | 0.0                   | 0.0                | 0.0            | 2                  | 0.8667             | 0.8125          | 0.8387      | 16              | 0.5             | 0.1667       | 0.25     | 12           | 0.7333                 | 0.6471              | 0.6875          | 17                  | 0.8333                 | 0.8333              | 0.8333          | 6                   | 1.0                   | 0.75               | 0.8571         | 4                  | 1.0                      | 1.0                   | 1.0               | 2                     | 0.7870                | 0.8947             | 0.8374         | 95                 | 0.0                  | 0.0               | 0.0           | 3                 | 0.0                  | 0.0               | 0.0           | 1                 | 0.7714           | 0.7875        | 0.7794    | 240           | 0.9928              | 1.0              | 0.9964       | 274              | 0.8653            | 0.8691         | 0.8672     | 0.9566           |
 ### Framework versions

 This model is a fine-tuned version of [cointegrated/rubert-tiny2](https://huggingface.co/cointegrated/rubert-tiny2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1428
+- Addressee Precision: 0.6364
 - Addressee Recall: 0.875
+- Addressee F1: 0.7368
 - Addressee Number: 8
 - Benefactive Precision: 0.0
 - Benefactive Recall: 0.0
 - Benefactive F1: 0.0
 - Benefactive Number: 2
+- Causator Precision: 0.9286
 - Causator Recall: 0.8125
+- Causator F1: 0.8667
 - Causator Number: 16
+- Cause Precision: 0.6
+- Cause Recall: 0.25
+- Cause F1: 0.3529
 - Cause Number: 12
+- Contrsubject Precision: 0.6364
+- Contrsubject Recall: 0.4118
+- Contrsubject F1: 0.5
 - Contrsubject Number: 17
+- Deliberative Precision: 1.0
+- Deliberative Recall: 0.6667
+- Deliberative F1: 0.8
 - Deliberative Number: 6
 - Destinative Precision: 1.0
+- Destinative Recall: 0.5
+- Destinative F1: 0.6667
 - Destinative Number: 4
 - Directivefinal Precision: 1.0
 - Directivefinal Recall: 1.0
 - Directivefinal F1: 1.0
 - Directivefinal Number: 2
+- Experiencer Precision: 0.8018
+- Experiencer Recall: 0.9368
+- Experiencer F1: 0.8641
 - Experiencer Number: 95
 - Instrument Precision: 0.0
 - Instrument Recall: 0.0
 - Limitative Recall: 0.0
 - Limitative F1: 0.0
 - Limitative Number: 1
+- Object Precision: 0.7589
+- Object Recall: 0.8
+- Object F1: 0.7789
 - Object Number: 240
+- Overall Precision: 0.7724
+- Overall Recall: 0.7857
+- Overall F1: 0.7790
+- Overall Accuracy: 0.9589
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 8.017672397578385e-05
+- train_batch_size: 4
 - eval_batch_size: 1
+- seed: 678943
 - gradient_accumulation_steps: 4
+- total_train_batch_size: 16
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.04
+- num_epochs: 4
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Addressee Precision | Addressee Recall | Addressee F1 | Addressee Number | Benefactive Precision | Benefactive Recall | Benefactive F1 | Benefactive Number | Causator Precision | Causator Recall | Causator F1 | Causator Number | Cause Precision | Cause Recall | Cause F1 | Cause Number | Contrsubject Precision | Contrsubject Recall | Contrsubject F1 | Contrsubject Number | Deliberative Precision | Deliberative Recall | Deliberative F1 | Deliberative Number | Destinative Precision | Destinative Recall | Destinative F1 | Destinative Number | Directivefinal Precision | Directivefinal Recall | Directivefinal F1 | Directivefinal Number | Experiencer Precision | Experiencer Recall | Experiencer F1 | Experiencer Number | Instrument Precision | Instrument Recall | Instrument F1 | Instrument Number | Limitative Precision | Limitative Recall | Limitative F1 | Limitative Number | Object Precision | Object Recall | Object F1 | Object Number | Overall Precision | Overall Recall | Overall F1 | Overall Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:-------------------:|:----------------:|:------------:|:----------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------:|:---------------:|:-----------:|:---------------:|:---------------:|:------------:|:--------:|:------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:----------------------:|:-------------------:|:---------------:|:-------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:------------------------:|:---------------------:|:-----------------:|:---------------------:|:---------------------:|:------------------:|:--------------:|:------------------:|:--------------------:|:-----------------:|:-------------:|:-----------------:|:--------------------:|:-----------------:|:-------------:|:-----------------:|:----------------:|:-------------:|:---------:|:-------------:|:-----------------:|:--------------:|:----------:|:----------------:|
+| 0.2206        | 1.0   | 490  | 0.1959          | 0.6667              | 0.25             | 0.3636       | 8                | 0.0                   | 0.0                | 0.0            | 2                  | 0.8667             | 0.8125          | 0.8387      | 16              | 0.0             | 0.0          | 0.0      | 12           | 1.0                    | 0.0588              | 0.1111          | 17                  | 0.0                    | 0.0                 | 0.0             | 6                   | 0.0                   | 0.0                | 0.0            | 4                  | 0.0                      | 0.0                   | 0.0               | 2                     | 0.7203                | 0.8947             | 0.7981         | 95                 | 0.0                  | 0.0               | 0.0           | 3                 | 0.0                  | 0.0               | 0.0           | 1                 | 0.6692           | 0.725         | 0.696     | 240           | 0.6927            | 0.6773         | 0.6849     | 0.9445           |
+| 0.1507        | 2.0   | 981  | 0.1492          | 0.5556              | 0.625            | 0.5882       | 8                | 0.0                   | 0.0                | 0.0            | 2                  | 0.8667             | 0.8125          | 0.8387      | 16              | 0.6             | 0.25         | 0.3529   | 12           | 0.75                   | 0.3529              | 0.48            | 17                  | 1.0                    | 0.1667              | 0.2857          | 6                   | 1.0                   | 0.25               | 0.4            | 4                  | 1.0                      | 1.0                   | 1.0               | 2                     | 0.8646                | 0.8737             | 0.8691         | 95                 | 0.0                  | 0.0               | 0.0           | 3                 | 0.0                  | 0.0               | 0.0           | 1                 | 0.7635           | 0.7667        | 0.7651    | 240           | 0.7884            | 0.7340         | 0.7602     | 0.9566           |
+| 0.1146        | 3.0   | 1472 | 0.1437          | 0.6364              | 0.875            | 0.7368       | 8                | 0.0                   | 0.0                | 0.0            | 2                  | 0.9286             | 0.8125          | 0.8667      | 16              | 0.6             | 0.25         | 0.3529   | 12           | 0.6429                 | 0.5294              | 0.5806          | 17                  | 1.0                    | 0.5                 | 0.6667          | 6                   | 1.0                   | 0.5                | 0.6667         | 4                  | 1.0                      | 1.0                   | 1.0               | 2                     | 0.8                   | 0.9263             | 0.8585         | 95                 | 0.0                  | 0.0               | 0.0           | 3                 | 0.0                  | 0.0               | 0.0           | 1                 | 0.7443           | 0.8125        | 0.7769    | 240           | 0.7612            | 0.7931         | 0.7768     | 0.9584           |
+| 0.0842        | 3.99  | 1960 | 0.1428          | 0.6364              | 0.875            | 0.7368       | 8                | 0.0                   | 0.0                | 0.0            | 2                  | 0.9286             | 0.8125          | 0.8667      | 16              | 0.6             | 0.25         | 0.3529   | 12           | 0.6364                 | 0.4118              | 0.5             | 17                  | 1.0                    | 0.6667              | 0.8             | 6                   | 1.0                   | 0.5                | 0.6667         | 4                  | 1.0                      | 1.0                   | 1.0               | 2                     | 0.8018                | 0.9368             | 0.8641         | 95                 | 0.0                  | 0.0               | 0.0           | 3                 | 0.0                  | 0.0               | 0.0           | 1                 | 0.7589           | 0.8           | 0.7789    | 240           | 0.7724            | 0.7857         | 0.7790     | 0.9589           |
 ### Framework versions

config.json CHANGED Viewed

@@ -12,54 +12,52 @@
   "hidden_size": 312,
   "id2label": {
     "0": "O",
-    "1": "B-Predicate",
-    "2": "B-Object",
-    "3": "B-Experiencer",
-    "4": "B-Cause",
-    "5": "B-Deliberative",
-    "6": "B-Causator",
-    "7": "B-ContrSubject",
-    "8": "B-Benefactive",
-    "9": "B-Addressee",
-    "10": "I-Object",
-    "11": "B-Destinative",
-    "12": "I-ContrSubject",
-    "13": "B-Instrument",
-    "14": "I-Deliberative",
-    "15": "B-Limitative",
-    "16": "B-DirectiveFinal",
-    "17": "B-Mediative",
-    "18": "I-DirectiveFinal",
-    "19": "B-DirectiveInitial",
-    "20": "I-DirectiveInitial",
-    "21": "I-Experiencer",
-    "22": "I-Cause"
   },
   "initializer_range": 0.02,
   "intermediate_size": 600,
   "label2id": {
-    "B-Addressee": 9,
-    "B-Benefactive": 8,
-    "B-Causator": 6,
-    "B-Cause": 4,
-    "B-ContrSubject": 7,
-    "B-Deliberative": 5,
-    "B-Destinative": 11,
-    "B-DirectiveFinal": 16,
-    "B-DirectiveInitial": 19,
-    "B-Experiencer": 3,
-    "B-Instrument": 13,
-    "B-Limitative": 15,
-    "B-Mediative": 17,
-    "B-Object": 2,
-    "B-Predicate": 1,
-    "I-Cause": 22,
-    "I-ContrSubject": 12,
-    "I-Deliberative": 14,
-    "I-DirectiveFinal": 18,
-    "I-DirectiveInitial": 20,
-    "I-Experiencer": 21,
-    "I-Object": 10,
     "O": 0
   },
   "layer_norm_eps": 1e-12,

   "hidden_size": 312,
   "id2label": {
     "0": "O",
+    "1": "B-Object",
+    "2": "B-Experiencer",
+    "3": "B-Cause",
+    "4": "B-Deliberative",
+    "5": "B-Causator",
+    "6": "B-ContrSubject",
+    "7": "B-Benefactive",
+    "8": "B-Addressee",
+    "9": "I-Object",
+    "10": "B-Destinative",
+    "11": "I-ContrSubject",
+    "12": "B-Instrument",
+    "13": "I-Deliberative",
+    "14": "B-Limitative",
+    "15": "B-DirectiveFinal",
+    "16": "B-Mediative",
+    "17": "I-DirectiveFinal",
+    "18": "B-DirectiveInitial",
+    "19": "I-DirectiveInitial",
+    "20": "I-Experiencer",
+    "21": "I-Cause"
   },
   "initializer_range": 0.02,
   "intermediate_size": 600,
   "label2id": {
+    "B-Addressee": 8,
+    "B-Benefactive": 7,
+    "B-Causator": 5,
+    "B-Cause": 3,
+    "B-ContrSubject": 6,
+    "B-Deliberative": 4,
+    "B-Destinative": 10,
+    "B-DirectiveFinal": 15,
+    "B-DirectiveInitial": 18,
+    "B-Experiencer": 2,
+    "B-Instrument": 12,
+    "B-Limitative": 14,
+    "B-Mediative": 16,
+    "B-Object": 1,
+    "I-Cause": 21,
+    "I-ContrSubject": 11,
+    "I-Deliberative": 13,
+    "I-DirectiveFinal": 17,
+    "I-DirectiveInitial": 19,
+    "I-Experiencer": 20,
+    "I-Object": 9,
     "O": 0
   },
   "layer_norm_eps": 1e-12,

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:88625d7c7d464a0afa4da7a3df8db8b6a68c9824280018291ff6cfa4ac8cf937
-size 116430550

 version https://git-lfs.github.com/spec/v1
+oid sha256:d557ee951a120b38ed50084bcfead5e3c16fae4e582949932a8e354cdfff96cc
+size 116429334

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:92bccd8424bd2b44800b16c85d657112a9c10b5e891d03dc6382bd1c44d6e5d8
-size 4091

 version https://git-lfs.github.com/spec/v1
+oid sha256:2a0dd85f6470330f97378a28fa85c857715de3b90e12ee9351662122d02a9901
+size 4155