End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -16,10 +16,10 @@ should probably proofread and complete it, then remove this comment. -->
 # modernbert-llm-router
-This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2857
-- F1: 0.9325
 ## Model description
@@ -50,11 +50,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 0.4844        | 1.0   | 313  | 0.4036          | 0.8962 |
-| 0.1605        | 2.0   | 626  | 0.3710          | 0.9036 |
-| 0.0319        | 3.0   | 939  | 0.2999          | 0.9238 |
-| 0.0103        | 4.0   | 1252 | 0.2893          | 0.9312 |
-| 0.0019        | 5.0   | 1565 | 0.2857          | 0.9325 |
 ### Framework versions

 # modernbert-llm-router
+This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0729
+- F1: 0.7793
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.8558        | 1.0   | 394  | 0.7758          | 0.7761 |
+| 0.5148        | 2.0   | 788  | 0.7445          | 0.7702 |
+| 0.2466        | 3.0   | 1182 | 0.8782          | 0.7809 |
+| 0.1071        | 4.0   | 1576 | 1.0124          | 0.7789 |
+| 0.0579        | 5.0   | 1970 | 1.0729          | 0.7793 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f3f3c8d0c8d216eef92419d157932eeecf8ab79ab939664f37405a9c7a632933
 size 598476704

 version https://git-lfs.github.com/spec/v1
+oid sha256:7f1ce5a59604db282913163fabe015f5834ea8656e358fc22153a1f5233923a9
 size 598476704

runs/Jan13_12-14-09_5c2d25ea5850/events.out.tfevents.1736770450.5c2d25ea5850.441003.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6a3ae08b7a59a2848ad3fab75a2fa82d3e6f664ce3c2eb2f3f3a1e55d1c98a90
-size 11422

 version https://git-lfs.github.com/spec/v1
+oid sha256:9ea019442ef268d3335ed16f916a93391eabebc9f1ed8739693ee668dd1ccacf
+size 12093

tokenizer.json CHANGED Viewed

@@ -1,7 +1,21 @@
 {
   "version": "1.0",
-  "truncation": null,
-  "padding": null,
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 512,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
+  "padding": {
+    "strategy": {
+      "Fixed": 512
+    },
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 50283,
+    "pad_type_id": 0,
+    "pad_token": "[PAD]"
+  },
   "added_tokens": [
     {
       "id": 0,