Initial Commit

Browse files

Files changed (4) hide show

README.md +42 -42
eval_result_ner.json +1 -1
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,14 +1,14 @@
 ---
-base_model: microsoft/mdeberta-v3-base
 library_name: transformers
 license: mit
 metrics:
 - precision
 - recall
 - f1
 - accuracy
-tags:
-- generated_from_trainer
 model-index:
 - name: scenario-kd-pre-ner-full-mdeberta_data-univner_full44
   results: []
@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2650
-- Precision: 0.8107
-- Recall: 0.8117
-- F1: 0.8112
-- Accuracy: 0.9806
 ## Model description
@@ -58,40 +58,40 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step  | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:------:|:-----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| 1.3559        | 0.2911 | 500   | 0.7891          | 0.4246    | 0.3525 | 0.3852 | 0.9433   |
-| 0.7017        | 0.5822 | 1000  | 0.5422          | 0.6316    | 0.6298 | 0.6307 | 0.9646   |
-| 0.528         | 0.8732 | 1500  | 0.4693          | 0.6856    | 0.6830 | 0.6843 | 0.9692   |
-| 0.4354        | 1.1643 | 2000  | 0.4211          | 0.7101    | 0.7376 | 0.7236 | 0.9724   |
-| 0.385         | 1.4554 | 2500  | 0.3893          | 0.7482    | 0.7374 | 0.7428 | 0.9747   |
-| 0.3575        | 1.7465 | 3000  | 0.3713          | 0.7678    | 0.7331 | 0.7500 | 0.9752   |
-| 0.3298        | 2.0375 | 3500  | 0.3550          | 0.7497    | 0.7800 | 0.7645 | 0.9761   |
-| 0.2879        | 2.3286 | 4000  | 0.3492          | 0.7964    | 0.7367 | 0.7654 | 0.9763   |
-| 0.2748        | 2.6197 | 4500  | 0.3272          | 0.7660    | 0.7924 | 0.7790 | 0.9782   |
-| 0.2644        | 2.9108 | 5000  | 0.3192          | 0.7817    | 0.7811 | 0.7814 | 0.9779   |
-| 0.2416        | 3.2019 | 5500  | 0.3239          | 0.8004    | 0.7681 | 0.7839 | 0.9782   |
-| 0.2303        | 3.4929 | 6000  | 0.3085          | 0.7846    | 0.7966 | 0.7905 | 0.9787   |
-| 0.2252        | 3.7840 | 6500  | 0.3051          | 0.7973    | 0.7883 | 0.7928 | 0.9787   |
-| 0.2159        | 4.0751 | 7000  | 0.3045          | 0.7987    | 0.7908 | 0.7948 | 0.9790   |
-| 0.2067        | 4.3662 | 7500  | 0.2979          | 0.7969    | 0.7943 | 0.7956 | 0.9793   |
-| 0.2028        | 4.6573 | 8000  | 0.2924          | 0.7855    | 0.8132 | 0.7991 | 0.9792   |
-| 0.1985        | 4.9483 | 8500  | 0.2904          | 0.8008    | 0.7986 | 0.7997 | 0.9791   |
-| 0.1867        | 5.2394 | 9000  | 0.2884          | 0.8       | 0.8033 | 0.8017 | 0.9797   |
-| 0.1838        | 5.5305 | 9500  | 0.2841          | 0.7997    | 0.8220 | 0.8107 | 0.9800   |
-| 0.1838        | 5.8216 | 10000 | 0.2810          | 0.7895    | 0.8165 | 0.8028 | 0.9798   |
-| 0.1786        | 6.1126 | 10500 | 0.2767          | 0.8065    | 0.8150 | 0.8108 | 0.9802   |
-| 0.1719        | 6.4037 | 11000 | 0.2790          | 0.8133    | 0.8057 | 0.8095 | 0.9803   |
-| 0.1706        | 6.6948 | 11500 | 0.2795          | 0.8140    | 0.7983 | 0.8061 | 0.9802   |
-| 0.1695        | 6.9859 | 12000 | 0.2723          | 0.8124    | 0.8121 | 0.8123 | 0.9807   |
-| 0.1638        | 7.2770 | 12500 | 0.2726          | 0.8070    | 0.8078 | 0.8074 | 0.9803   |
-| 0.162         | 7.5680 | 13000 | 0.2724          | 0.8118    | 0.8173 | 0.8146 | 0.9807   |
-| 0.1619        | 7.8591 | 13500 | 0.2678          | 0.8018    | 0.8235 | 0.8125 | 0.9805   |
-| 0.1594        | 8.1502 | 14000 | 0.2719          | 0.8103    | 0.8068 | 0.8086 | 0.9800   |
-| 0.1571        | 8.4413 | 14500 | 0.2688          | 0.8097    | 0.8127 | 0.8112 | 0.9805   |
-| 0.1585        | 8.7324 | 15000 | 0.2673          | 0.8126    | 0.8150 | 0.8138 | 0.9806   |
-| 0.1546        | 9.0234 | 15500 | 0.2658          | 0.8105    | 0.8120 | 0.8112 | 0.9805   |
-| 0.1534        | 9.3145 | 16000 | 0.2652          | 0.8101    | 0.8198 | 0.8149 | 0.9807   |
-| 0.1535        | 9.6056 | 16500 | 0.2646          | 0.8097    | 0.8140 | 0.8119 | 0.9807   |
-| 0.1531        | 9.8967 | 17000 | 0.2650          | 0.8107    | 0.8117 | 0.8112 | 0.9806   |
 ### Framework versions

 ---
 library_name: transformers
 license: mit
+base_model: microsoft/mdeberta-v3-base
+tags:
+- generated_from_trainer
 metrics:
 - precision
 - recall
 - f1
 - accuracy
 model-index:
 - name: scenario-kd-pre-ner-full-mdeberta_data-univner_full44
   results: []
 This model is a fine-tuned version of [microsoft/mdeberta-v3-base](https://huggingface.co/microsoft/mdeberta-v3-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 46.6459
+- Precision: 0.8272
+- Recall: 0.8335
+- F1: 0.8303
+- Accuracy: 0.9822
 ## Model description
 | Training Loss | Epoch  | Step  | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:------:|:-----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 149.9231      | 0.2911 | 500   | 110.8748        | 0.5070    | 0.1208 | 0.1951 | 0.9347   |
+| 101.1776      | 0.5822 | 1000  | 93.5222         | 0.7123    | 0.6586 | 0.6844 | 0.9701   |
+| 89.9222       | 0.8732 | 1500  | 86.5450         | 0.7487    | 0.7276 | 0.7380 | 0.9748   |
+| 83.6167       | 1.1643 | 2000  | 81.4135         | 0.7818    | 0.7501 | 0.7656 | 0.9769   |
+| 78.8225       | 1.4554 | 2500  | 77.5955         | 0.7905    | 0.7547 | 0.7722 | 0.9777   |
+| 75.3094       | 1.7465 | 3000  | 74.1885         | 0.7825    | 0.7798 | 0.7812 | 0.9783   |
+| 71.9149       | 2.0375 | 3500  | 71.4168         | 0.7893    | 0.8020 | 0.7956 | 0.9790   |
+| 68.8017       | 2.3286 | 4000  | 68.6904         | 0.8194    | 0.7778 | 0.7981 | 0.9794   |
+| 66.2935       | 2.6197 | 4500  | 66.3018         | 0.7981    | 0.8070 | 0.8025 | 0.9802   |
+| 64.1282       | 2.9108 | 5000  | 64.3227         | 0.7988    | 0.8130 | 0.8059 | 0.9803   |
+| 61.983        | 3.2019 | 5500  | 62.6362         | 0.8141    | 0.8114 | 0.8128 | 0.9808   |
+| 60.0914       | 3.4929 | 6000  | 60.8145         | 0.8106    | 0.8149 | 0.8127 | 0.9808   |
+| 58.497        | 3.7840 | 6500  | 59.2819         | 0.8126    | 0.8158 | 0.8142 | 0.9812   |
+| 57.0173       | 4.0751 | 7000  | 58.0187         | 0.8126    | 0.7990 | 0.8058 | 0.9804   |
+| 55.5793       | 4.3662 | 7500  | 56.7794         | 0.8033    | 0.8240 | 0.8135 | 0.9808   |
+| 54.4031       | 4.6573 | 8000  | 55.5089         | 0.8072    | 0.8287 | 0.8178 | 0.9812   |
+| 53.2147       | 4.9483 | 8500  | 54.5450         | 0.8128    | 0.8094 | 0.8111 | 0.9810   |
+| 52.0438       | 5.2394 | 9000  | 53.6043         | 0.8145    | 0.8222 | 0.8184 | 0.9814   |
+| 51.102        | 5.5305 | 9500  | 52.6326         | 0.8100    | 0.8261 | 0.818  | 0.9811   |
+| 50.3841       | 5.8216 | 10000 | 51.8428         | 0.8138    | 0.8300 | 0.8219 | 0.9815   |
+| 49.4812       | 6.1126 | 10500 | 51.1615         | 0.8192    | 0.8296 | 0.8244 | 0.9819   |
+| 48.7273       | 6.4037 | 11000 | 50.4750         | 0.8156    | 0.8201 | 0.8178 | 0.9813   |
+| 48.1157       | 6.6948 | 11500 | 49.8869         | 0.8190    | 0.8259 | 0.8224 | 0.9818   |
+| 47.4821       | 6.9859 | 12000 | 49.2946         | 0.8203    | 0.8279 | 0.8241 | 0.9819   |
+| 46.889        | 7.2770 | 12500 | 48.8428         | 0.8178    | 0.8224 | 0.8201 | 0.9816   |
+| 46.3939       | 7.5680 | 13000 | 48.3821         | 0.8264    | 0.8224 | 0.8244 | 0.9819   |
+| 46.0878       | 7.8591 | 13500 | 47.9867         | 0.8210    | 0.8272 | 0.8241 | 0.9817   |
+| 45.669        | 8.1502 | 14000 | 47.6715         | 0.8207    | 0.8257 | 0.8232 | 0.9818   |
+| 45.3064       | 8.4413 | 14500 | 47.3744         | 0.8167    | 0.8336 | 0.8251 | 0.9818   |
+| 45.0768       | 8.7324 | 15000 | 47.1812         | 0.8221    | 0.8235 | 0.8228 | 0.9821   |
+| 44.8212       | 9.0234 | 15500 | 46.9769         | 0.8172    | 0.8274 | 0.8223 | 0.9816   |
+| 44.6107       | 9.3145 | 16000 | 46.8141         | 0.8204    | 0.8298 | 0.8250 | 0.9819   |
+| 44.4495       | 9.6056 | 16500 | 46.7872         | 0.8189    | 0.8285 | 0.8236 | 0.9819   |
+| 44.51         | 9.8967 | 17000 | 46.6459         | 0.8272    | 0.8335 | 0.8303 | 0.9822   |
 ### Framework versions

eval_result_ner.json CHANGED Viewed

@@ -1 +1 @@

- {"ceb_gja": {"precision": 0.~~6323529411764706~~, "recall": 0.~~8775510204081632~~, "f1": 0.~~7350427350427351~~, "accuracy": 0.~~976061776061776~~}, "en_pud": {"precision": 0.~~7895771878072763~~, "recall": 0.~~7469767441860465~~, "f1": 0.~~7676864244741873~~, "accuracy": 0.~~9780884019644881~~}, "de_pud": {"precision": 0.~~7371375116931712~~, "recall": 0.~~7584215591915303~~, "f1": 0.~~7476280834914611~~, "accuracy": 0.~~9724344850217993~~}, "pt_pud": {"precision": 0.~~810928013876843~~, "recall": 0.~~8507734303912647~~, "f1": 0.~~830373001776199~~, "accuracy": 0.~~9829538172341608~~}, "ru_pud": {"precision": 0.~~6553930530164533~~, "recall": 0.~~6920849420849421~~, "f1": 0.~~6732394366197183~~, "accuracy": 0.~~9688452596228365~~}, "sv_pud": {"precision": 0.~~8186323092170465~~, "recall": 0.~~8027210884353742~~, "f1": 0.~~8105986261040234~~, "accuracy": 0.~~9815999161249738~~}, "tl_trg": {"precision": 0.~~6785714285714286~~, "recall": 0.~~8260869565217391~~, "f1": 0.~~7450980392156864~~, "accuracy": 0.~~9836512261580381~~}, "tl_ugnayan": {"precision": 0.~~5581395348837209~~, "recall": 0.~~7272727272727273~~, "f1": 0.~~6315789473684211~~, "accuracy": 0.~~9690063810391978~~}, "zh_gsd": {"precision": 0.~~7762148337595908~~, "recall": 0.~~7913950456323338~~, "f1": 0.~~7837314396384765~~, "accuracy": 0.~~9711122211122211~~}, "zh_gsdsimp": {"precision": 0.~~7893368010403121~~, "recall": 0.~~7955439056356488~~, "f1": 0.~~7924281984334205~~, "accuracy": 0.~~9726107226107226~~}, "hr_set": {"precision": 0.~~8732782369146006~~, "recall": 0.~~9037776193870278~~, "f1": 0.~~8882661996497373~~, "accuracy": 0.~~9865210222588623~~}, "da_ddt": {"precision": 0.~~8513189448441247~~, "recall": 0.~~7941834451901566~~, "f1": 0.~~8217592592592593~~, "accuracy": 0.~~9859323555821611~~}, "en_ewt": {"precision": 0.~~7971291866028708~~, "recall": 0.~~765625~~, "f1": 0.~~7810595405532114~~, "accuracy": 0.~~9782842570825199~~}, "pt_bosque": {"precision": 0.~~8327922077922078~~, "recall": 0.~~8444444444444444~~, "f1": 0.~~8385778504290968~~, "accuracy": 0.~~9854368932038835~~}, "sr_set": {"precision": 0.~~9155920281359906~~, "recall": 0.~~922077922077922~~, "f1": 0.~~9188235294117646~~, "accuracy": 0.~~9883547850450923~~}, "sk_snk": {"precision": 0.~~7766203703703703~~, "recall": 0.~~7333333333333333~~, "f1": 0.~~7543563799887577~~, "accuracy": 0.~~9663159547738693~~}, "sv_talbanken": {"precision": 0.~~8262910798122066~~, "recall": 0.~~8979591836734694~~, "f1": 0.~~8606356968215159~~, "accuracy": 0.~~9970555037542327~~}}

+ {"ceb_gja": {"precision": 0.5753424657534246, "recall": 0.8571428571428571, "f1": 0.6885245901639344, "accuracy": 0.9698841698841699}, "en_pud": {"precision": 0.7922330097087379, "recall": 0.7590697674418605, "f1": 0.7752969121140143, "accuracy": 0.9788439743105403}, "de_pud": {"precision": 0.7308039747064138, "recall": 0.7786333012512031, "f1": 0.7539608574091333, "accuracy": 0.9741690497398153}, "pt_pud": {"precision": 0.8458110516934046, "recall": 0.8635122838944495, "f1": 0.8545700135074291, "accuracy": 0.9861152646644167}, "ru_pud": {"precision": 0.6904541241890639, "recall": 0.7191119691119691, "f1": 0.7044917257683214, "accuracy": 0.9709119090674244}, "sv_pud": {"precision": 0.8397626112759644, "recall": 0.8250728862973761, "f1": 0.8323529411764706, "accuracy": 0.9838540574543929}, "tl_trg": {"precision": 0.625, "recall": 0.8695652173913043, "f1": 0.7272727272727273, "accuracy": 0.9809264305177112}, "tl_ugnayan": {"precision": 0.5476190476190477, "recall": 0.696969696969697, "f1": 0.6133333333333334, "accuracy": 0.9699179580674567}, "zh_gsd": {"precision": 0.7884130982367759, "recall": 0.8161668839634941, "f1": 0.8020499679692504, "accuracy": 0.973942723942724}, "zh_gsdsimp": {"precision": 0.8079385403329066, "recall": 0.8269986893840104, "f1": 0.817357512953368, "accuracy": 0.9768564768564768}, "hr_set": {"precision": 0.8982456140350877, "recall": 0.9123307198859587, "f1": 0.9052333804809052, "accuracy": 0.9884171475680132}, "da_ddt": {"precision": 0.8530120481927711, "recall": 0.7919463087248322, "f1": 0.8213457076566124, "accuracy": 0.9864312082210915}, "en_ewt": {"precision": 0.8107317073170732, "recall": 0.7637867647058824, "f1": 0.7865593942262186, "accuracy": 0.9780850300832769}, "pt_bosque": {"precision": 0.8609271523178808, "recall": 0.8559670781893004, "f1": 0.8584399504746181, "accuracy": 0.9861976525141284}, "sr_set": {"precision": 0.9268867924528302, "recall": 0.9279811097992916, "f1": 0.927433628318584, "accuracy": 0.9893179231240697}, "sk_snk": {"precision": 0.796037296037296, "recall": 0.746448087431694, "f1": 0.7704455724760293, "accuracy": 0.9689070351758794}, "sv_talbanken": {"precision": 0.8372093023255814, "recall": 0.9183673469387755, "f1": 0.8759124087591241, "accuracy": 0.9974971781910978}}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4367c62c750054fb8da34a3a6c760e36475760449cae410331985f1215983f40
 size 944366708

 version https://git-lfs.github.com/spec/v1
+oid sha256:97884517040b4dd7d4866238e5a6a9c881fda40e6b3786dcdc2618388bcb0b01
 size 944366708

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6bebaa03afdc7d5fe605635f9cfd09c0d6985625bee0fdb3cb48c1978ebca472
 size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:9e957d70cc95206f48e8a18f5936433d0a5f0a912e3398ebd08ab86c0c84c52f
 size 5304