tcapelle
/

fluency-scorer

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

tcapelle commited on 12 days ago

Commit

30b10c4

·

verified ·

1 Parent(s): f7ca40e

Model save

Files changed (1) hide show

README.md +15 -15

README.md CHANGED Viewed

@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5555
-- F1: 0.6007
-- Accuracy: 0.696
-- Precision: 0.6271
-- Recall: 0.696
 ## Model description
@@ -45,8 +45,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
-- train_batch_size: 128
-- eval_batch_size: 128
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
@@ -55,17 +55,17 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | F1     | Accuracy | Precision | Recall |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:--------:|:---------:|:------:|
-| No log        | 0     | 0    | 0.8275          | 0.2749 | 0.336    | 0.5265    | 0.336  |
-| No log        | 1.0   | 8    | 0.6056          | 0.6355 | 0.669    | 0.6265    | 0.669  |
-| 0.7136        | 2.0   | 16   | 0.5566          | 0.6004 | 0.693    | 0.6178    | 0.693  |
-| 0.5579        | 3.0   | 24   | 0.5555          | 0.6007 | 0.696    | 0.6271    | 0.696  |
 ### Framework versions
 - Transformers 4.48.1
-- Pytorch 2.5.1+cu124
-- Datasets 3.2.0
 - Tokenizers 0.21.0

 This model is a fine-tuned version of [answerdotai/ModernBERT-base](https://huggingface.co/answerdotai/ModernBERT-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3830
+- F1: 0.8183
+- Accuracy: 0.8212
+- Precision: 0.8171
+- Recall: 0.8212
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 3e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: cosine
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | F1     | Accuracy | Precision | Recall |
+|:-------------:|:-----:|:-----:|:---------------:|:------:|:--------:|:---------:|:------:|
+| No log        | 0     | 0     | 0.7214          | 0.5368 | 0.5168   | 0.6201    | 0.5168 |
+| 0.5801        | 1.0   | 6158  | 0.4019          | 0.8069 | 0.8092   | 0.8056    | 0.8092 |
+| 0.4354        | 2.0   | 12316 | 0.3835          | 0.8176 | 0.8212   | 0.8165    | 0.8212 |
+| 0.4089        | 3.0   | 18474 | 0.3830          | 0.8183 | 0.8212   | 0.8171    | 0.8212 |
 ### Framework versions
 - Transformers 4.48.1
+- Pytorch 2.4.1+cu121
+- Datasets 3.0.1
 - Tokenizers 0.21.0