mehdie/fine_tuned_mBERT

Browse files

Files changed (7) hide show

README.md +14 -13
model.safetensors +1 -1
runs/May15_12-51-29_yoga/events.out.tfevents.1715770289.yoga.8932.0 +3 -0
runs/May15_12-51-29_yoga/events.out.tfevents.1715770343.yoga.8932.1 +3 -0
runs/May15_12-52-51_yoga/events.out.tfevents.1715770371.yoga.9258.0 +3 -0
runs/May15_12-52-51_yoga/events.out.tfevents.1715770433.yoga.9258.1 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -19,11 +19,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google-bert/bert-base-multilingual-cased](https://huggingface.co/google-bert/bert-base-multilingual-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1693
-- F1: 0.7857
-- F5: 0.8078
-- Precision: 0.7333
-- Recall: 0.8462
 ## Model description
@@ -49,20 +49,21 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.2
-- num_epochs: 7
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     | F5     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:---------:|:------:|
-| No log        | 1.0   | 30   | 0.2580          | 0.0    | 0.0    | 0.0       | 0.0    |
-| No log        | 2.0   | 60   | 0.2658          | 0.08   | 0.0592 | 1.0       | 0.0417 |
-| No log        | 3.0   | 90   | 0.1601          | 0.7273 | 0.7644 | 0.6452    | 0.8333 |
-| No log        | 4.0   | 120  | 0.1913          | 0.5946 | 0.5340 | 0.8462    | 0.4583 |
-| No log        | 5.0   | 150  | 0.2591          | 0.6269 | 0.7030 | 0.4884    | 0.875  |
-| No log        | 6.0   | 180  | 0.1832          | 0.625  | 0.625  | 0.625     | 0.625  |
-| No log        | 7.0   | 210  | 0.2513          | 0.6383 | 0.6332 | 0.6522    | 0.625  |
 ### Framework versions

 This model is a fine-tuned version of [google-bert/bert-base-multilingual-cased](https://huggingface.co/google-bert/bert-base-multilingual-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1614
+- F1: 0.7869
+- F5: 0.8020
+- Precision: 0.75
+- Recall: 0.8276
 ## Model description
 - optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.2
+- num_epochs: 8
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | F1     | F5     | Precision | Recall |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:---------:|:------:|
+| No log        | 1.0   | 30   | 0.2615          | 0.0    | 0.0    | 0.0       | 0.0    |
+| No log        | 2.0   | 60   | 0.1838          | 0.5333 | 0.4626 | 0.8889    | 0.3810 |
+| No log        | 3.0   | 90   | 0.2338          | 0.3077 | 0.2491 | 0.8       | 0.1905 |
+| No log        | 4.0   | 120  | 0.2003          | 0.6667 | 0.6268 | 0.8       | 0.5714 |
+| No log        | 5.0   | 150  | 0.2643          | 0.5    | 0.4906 | 0.5263    | 0.4762 |
+| No log        | 6.0   | 180  | 0.2211          | 0.6486 | 0.6168 | 0.75      | 0.5714 |
+| No log        | 7.0   | 210  | 0.2233          | 0.6    | 0.6391 | 0.5172    | 0.7143 |
+| No log        | 8.0   | 240  | 0.3328          | 0.5    | 0.5647 | 0.3846    | 0.7143 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f1a81791380147aa76429d7f2b7753dfff496489e0bdef800cddd7f331426f1d
 size 711443456

 version https://git-lfs.github.com/spec/v1
+oid sha256:a875f15504f2b55744513b26a428a92fe9eb8c23e916dedcef0d6470d35ed041
 size 711443456

runs/May15_12-51-29_yoga/events.out.tfevents.1715770289.yoga.8932.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b4539e6d653dc792b71664585737f98f40b85cede5d6b6cd8d2f71a342279679
+size 8416

runs/May15_12-51-29_yoga/events.out.tfevents.1715770343.yoga.8932.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7d29088ffb5374e2c3a85dc13e106a95c16e53b9047445e32ffefaacd092a3ba
+size 554

runs/May15_12-52-51_yoga/events.out.tfevents.1715770371.yoga.9258.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f2d0e00f69095f709645b2924f4c40cb67e572ddce9a5e141b7ea479e8a7de31
+size 8882

runs/May15_12-52-51_yoga/events.out.tfevents.1715770433.yoga.9258.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3c95c42662e71f8b23bedcecefc7b5ad0a210d8644c64017a5ee07d0e6fd06d4
+size 554

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:feafb12dae99288d27f51f5c2198a84f9202bc396fbcc6e35fb017a7ddd62a1b
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:68f07a9cab6927341587334ca897338c93f50d07ae3a0e403ab4a360ded7d7b1
 size 4920