Dagobert42
/

xlnet-base-cased-biored-augmented

@@ -28,12 +28,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [xlnet-base-cased](https://huggingface.co/xlnet-base-cased) on the bigbio/biored dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1676
-- Accuracy: 0.9551
-- Precision: 0.8878
-- Recall: 0.847
-- F1: 0.8624
-- Weighted F1: 0.9551
 ## Model description
@@ -52,7 +52,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1.8e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
@@ -64,16 +64,14 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1     | Weighted F1 |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|:-----------:|
-| No log        | 1.0   | 20   | 0.2094          | 0.9301   | 0.7924    | 0.7914 | 0.7886 | 0.9301      |
-| No log        | 2.0   | 40   | 0.1915          | 0.9391   | 0.8202    | 0.8032 | 0.808  | 0.9383      |
-| No log        | 3.0   | 60   | 0.1901          | 0.9425   | 0.8239    | 0.8169 | 0.8197 | 0.9418      |
-| No log        | 4.0   | 80   | 0.1872          | 0.9461   | 0.8361    | 0.8277 | 0.8304 | 0.9453      |
-| No log        | 5.0   | 100  | 0.2001          | 0.9455   | 0.8269    | 0.8251 | 0.8245 | 0.9448      |
-| No log        | 6.0   | 120  | 0.2063          | 0.9462   | 0.845     | 0.8288 | 0.8354 | 0.9457      |
-| No log        | 7.0   | 140  | 0.2081          | 0.9458   | 0.8153    | 0.8353 | 0.8235 | 0.9458      |
-| No log        | 8.0   | 160  | 0.2274          | 0.9454   | 0.8192    | 0.8329 | 0.8245 | 0.9452      |
-| No log        | 9.0   | 180  | 0.2286          | 0.9475   | 0.8298    | 0.8332 | 0.8303 | 0.9471      |
-| No log        | 10.0  | 200  | 0.2404          | 0.9473   | 0.8352    | 0.83   | 0.8314 | 0.9467      |
 ### Framework versions

 This model is a fine-tuned version of [xlnet-base-cased](https://huggingface.co/xlnet-base-cased) on the bigbio/biored dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1576
+- Accuracy: 0.9544
+- Precision: 0.8802
+- Recall: 0.858
+- F1: 0.8663
+- Weighted F1: 0.9546
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1.5e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | Precision | Recall | F1     | Weighted F1 |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:---------:|:------:|:------:|:-----------:|
+| No log        | 1.0   | 20   | 0.2001          | 0.9348   | 0.8286    | 0.7628 | 0.791  | 0.9332      |
+| No log        | 2.0   | 40   | 0.1961          | 0.9367   | 0.7938    | 0.8119 | 0.8015 | 0.9365      |
+| No log        | 3.0   | 60   | 0.1902          | 0.9422   | 0.8297    | 0.8124 | 0.8202 | 0.9416      |
+| No log        | 4.0   | 80   | 0.1948          | 0.9426   | 0.8323    | 0.8226 | 0.8269 | 0.9422      |
+| No log        | 5.0   | 100  | 0.1969          | 0.9429   | 0.8152    | 0.8279 | 0.8208 | 0.9431      |
+| No log        | 6.0   | 120  | 0.2071          | 0.9426   | 0.8194    | 0.8324 | 0.8257 | 0.943       |
+| No log        | 7.0   | 140  | 0.2024          | 0.9455   | 0.8244    | 0.8284 | 0.8258 | 0.9453      |
+| No log        | 8.0   | 160  | 0.2143          | 0.9451   | 0.8241    | 0.8294 | 0.8257 | 0.9449      |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6fc847eeef8ba0fc8a3adff9f6e6a20c8d8dc83b149570197b4e5d667ef97849
 size 466917412

 version https://git-lfs.github.com/spec/v1
+oid sha256:38a73cc9e217b8b60675df8c2a4c0c5733837592ce2ce1f85fa2e0f7e71265f7
 size 466917412

tokenizer.json CHANGED Viewed

@@ -6,16 +6,7 @@
     "strategy": "LongestFirst",
     "stride": 0
   },
-  "padding": {
-    "strategy": {
-      "Fixed": 512
-    },
-    "direction": "Left",
-    "pad_to_multiple_of": null,
-    "pad_id": 5,
-    "pad_type_id": 3,
-    "pad_token": "<pad>"
-  },
   "added_tokens": [
     {
       "id": 0,

     "strategy": "LongestFirst",
     "stride": 0
   },
+  "padding": null,
   "added_tokens": [
     {
       "id": 0,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:64e325d07f2a1379926408048773612769801fda2abced01956b307b5d668c19
 size 4219

 version https://git-lfs.github.com/spec/v1
+oid sha256:015e17f0f2bafab9d81d61e5dfd4433b10c65c6fda6d23e857668b07ec8838d2
 size 4219