aneespatka
/

modernbert-llm-sentiment

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

aneespatka commited on 17 days ago

Commit

19b2bb8

·

verified ·

1 Parent(s): 258042b

End of training

Files changed (2) hide show

README.md +5 -8
tokenizer.json +16 -2

README.md CHANGED Viewed

@@ -44,17 +44,14 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss | F1     |
-|:-------------:|:-----:|:-----:|:---------------:|:------:|
-| 0.0           | 1.0   | 3827  | nan             | 0.2648 |
-| 0.0           | 2.0   | 7654  | nan             | 0.2648 |
-| 0.0           | 3.0   | 11481 | nan             | 0.2648 |
-| 0.0           | 4.0   | 15308 | nan             | 0.2648 |
-| 0.0           | 5.0   | 19135 | nan             | 0.2648 |
 ### Framework versions

 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH_FUSED with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 2
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | F1     |
+|:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.0           | 1.0   | 3827 | nan             | 0.2648 |
+| 0.0           | 2.0   | 7654 | nan             | 0.2648 |
 ### Framework versions

tokenizer.json CHANGED Viewed

@@ -1,7 +1,21 @@
 {
   "version": "1.0",
-  "truncation": null,
-  "padding": null,
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 512,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
+  "padding": {
+    "strategy": {
+      "Fixed": 512
+    },
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 50283,
+    "pad_type_id": 0,
+    "pad_token": "[PAD]"
+  },
   "added_tokens": [
     {
       "id": 0,