pabRomero
/

RoBERTa-full-finetuned-ner-pablo

@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [FacebookAI/roberta-base](https://huggingface.co/FacebookAI/roberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0908
-- Precision: 0.815
-- Recall: 0.8050
-- F1: 0.8100
-- Accuracy: 0.9767
 ## Model description
@@ -44,32 +44,27 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0002
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
-- gradient_accumulation_steps: 4
-- total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_ratio: 0.05
-- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
-|:-------------:|:------:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| No log        | 0.9970 | 252  | 0.1094          | 0.6862    | 0.7348 | 0.7097 | 0.9688   |
-| 0.1923        | 1.9980 | 505  | 0.0836          | 0.8005    | 0.7868 | 0.7936 | 0.9762   |
-| 0.1923        | 2.9990 | 758  | 0.0816          | 0.8109    | 0.7855 | 0.7980 | 0.9755   |
-| 0.0504        | 4.0    | 1011 | 0.0839          | 0.8073    | 0.8028 | 0.8051 | 0.9763   |
-| 0.0504        | 4.9852 | 1260 | 0.0908          | 0.815     | 0.8050 | 0.8100 | 0.9767   |
 ### Framework versions
 - Transformers 4.44.2
-- Pytorch 2.4.0+cu121
 - Datasets 2.21.0
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [FacebookAI/roberta-base](https://huggingface.co/FacebookAI/roberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0616
+- Precision: 0.8439
+- Recall: 0.8346
+- F1: 0.8392
+- Accuracy: 0.9811
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 32
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| No log        | 1.0   | 231  | 0.0805          | 0.7890    | 0.8039 | 0.7964 | 0.9755   |
+| No log        | 2.0   | 462  | 0.0616          | 0.8439    | 0.8346 | 0.8392 | 0.9811   |
 ### Framework versions
 - Transformers 4.44.2
+- Pytorch 2.4.1+cu121
 - Datasets 2.21.0
 - Tokenizers 0.19.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9ad5eb8cdfdf6e9c7724ce664864d8b85d469237b11a02ba214d66b94bd8919b
 size 496302532

 version https://git-lfs.github.com/spec/v1
+oid sha256:0f75650dd2452bfca47d17cf7e83a3d70c403f6b05eba17afe9a9a46b03de411
 size 496302532

runs/Sep05_12-17-25_83295d15965e/events.out.tfevents.1725538646.83295d15965e.5325.6 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4da21844b916f7907affc0044db57d9802b6b1a1799864e51b4b53f443360dca
-size 6234

 version https://git-lfs.github.com/spec/v1
+oid sha256:5fd54d2493d5eb313ae9ee04a53fa53df45c3ef4410321c75d341572d8a134e9
+size 7060