morten-j commited on Apr 30

Commit

ddb52f9

•

1 Parent(s): a9c150a

mehdie/fine_tuned_mBERT

Browse files

Files changed (22) hide show

README.md +73 -0
config.json +40 -0
model.safetensors +3 -0
runs/Apr30_09-04-55_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460696.a256-a40-06.srv.aau.dk.1177277.0 +3 -0
runs/Apr30_09-04-55_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460729.a256-a40-06.srv.aau.dk.1177277.1 +3 -0
runs/Apr30_09-07-24_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460845.a256-a40-06.srv.aau.dk.1177625.0 +3 -0
runs/Apr30_09-07-24_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460885.a256-a40-06.srv.aau.dk.1177625.1 +3 -0
runs/Apr30_09-08-54_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460935.a256-a40-06.srv.aau.dk.1177944.0 +3 -0
runs/Apr30_09-08-54_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460977.a256-a40-06.srv.aau.dk.1177944.1 +3 -0
runs/Apr30_09-15-57_a256-a40-06.srv.aau.dk/events.out.tfevents.1714461358.a256-a40-06.srv.aau.dk.1178259.0 +3 -0
runs/Apr30_09-15-57_a256-a40-06.srv.aau.dk/events.out.tfevents.1714461400.a256-a40-06.srv.aau.dk.1178259.1 +3 -0
runs/Apr30_09-18-27_a256-a40-06.srv.aau.dk/events.out.tfevents.1714461508.a256-a40-06.srv.aau.dk.1178883.0 +3 -0
runs/Apr30_09-18-27_a256-a40-06.srv.aau.dk/events.out.tfevents.1714461568.a256-a40-06.srv.aau.dk.1178883.1 +3 -0
runs/Apr30_09-26-25_a256-a40-06.srv.aau.dk/events.out.tfevents.1714461986.a256-a40-06.srv.aau.dk.1179240.0 +3 -0
runs/Apr30_09-26-25_a256-a40-06.srv.aau.dk/events.out.tfevents.1714462017.a256-a40-06.srv.aau.dk.1179240.1 +3 -0
runs/Apr30_09-52-25_a256-a40-06.srv.aau.dk/events.out.tfevents.1714463546.a256-a40-06.srv.aau.dk.1179669.0 +3 -0
runs/Apr30_09-52-25_a256-a40-06.srv.aau.dk/events.out.tfevents.1714463584.a256-a40-06.srv.aau.dk.1179669.1 +3 -0
special_tokens_map.json +7 -0
tokenizer.json +0 -0
tokenizer_config.json +55 -0
training_args.bin +3 -0
vocab.txt +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,73 @@

+---
+license: apache-2.0
+base_model: google-bert/bert-base-multilingual-cased
+tags:
+- generated_from_trainer
+metrics:
+- f1
+- precision
+- recall
+model-index:
+- name: fine_tuned_mBERT
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# fine_tuned_mBERT
+This model is a fine-tuned version of [google-bert/bert-base-multilingual-cased](https://huggingface.co/google-bert/bert-base-multilingual-cased) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.1592
+- F1: 0.7805
+- F5: 0.8186
+- Precision: 0.6957
+- Recall: 0.8889
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 256
+- eval_batch_size: 256
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 8
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | F1     | F5     | Precision | Recall |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:---------:|:------:|
+| No log        | 1.0   | 6    | 0.3289          | 0.0    | 0.0    | 0.0       | 0.0    |
+| No log        | 2.0   | 12   | 0.3095          | 0.0    | 0.0    | 0.0       | 0.0    |
+| No log        | 3.0   | 18   | 0.2680          | 0.3636 | 0.2925 | 1.0       | 0.2222 |
+| No log        | 4.0   | 24   | 0.2876          | 0.4    | 0.3424 | 0.7143    | 0.2778 |
+| No log        | 5.0   | 30   | 0.2680          | 0.6087 | 0.6638 | 0.5       | 0.7778 |
+| No log        | 6.0   | 36   | 0.2565          | 0.4615 | 0.4024 | 0.75      | 0.3333 |
+| No log        | 7.0   | 42   | 0.2320          | 0.6341 | 0.6651 | 0.5652    | 0.7222 |
+| No log        | 8.0   | 48   | 0.2251          | 0.6    | 0.5574 | 0.75      | 0.5    |
+### Framework versions
+- Transformers 4.38.2
+- Pytorch 2.3.0a0+ebedce2
+- Datasets 2.17.1
+- Tokenizers 0.15.2

config.json ADDED Viewed

	@@ -0,0 +1,40 @@

+{
+  "_name_or_path": "google-bert/bert-base-multilingual-cased",
+  "architectures": [
+    "BertForSequenceClassification"
+  ],
+  "attention_probs_dropout_prob": 0.1,
+  "classifier_dropout": null,
+  "directionality": "bidi",
+  "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.1,
+  "hidden_size": 768,
+  "id2label": {
+    "0": "NO_MATCH",
+    "1": "MATCH"
+  },
+  "initializer_range": 0.02,
+  "intermediate_size": 3072,
+  "label2id": {
+    "MATCH": 1,
+    "NO_MATCH": 0
+  },
+  "layer_norm_eps": 1e-12,
+  "max_position_embeddings": 512,
+  "model_type": "bert",
+  "num_attention_heads": 12,
+  "num_hidden_layers": 12,
+  "pad_token_id": 0,
+  "pooler_fc_size": 768,
+  "pooler_num_attention_heads": 12,
+  "pooler_num_fc_layers": 3,
+  "pooler_size_per_head": 128,
+  "pooler_type": "first_token_transform",
+  "position_embedding_type": "absolute",
+  "problem_type": "single_label_classification",
+  "torch_dtype": "float32",
+  "transformers_version": "4.38.2",
+  "type_vocab_size": 2,
+  "use_cache": true,
+  "vocab_size": 119547
+}

model.safetensors ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:177b1e40b192e786438df4797fe1bbb220b0d701d0effb7bcde1308040a7557b
+size 711443456

runs/Apr30_09-04-55_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460696.a256-a40-06.srv.aau.dk.1177277.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:dbe6e1b7f82749974e4179e80c4d6a15f30c6ee8026b0f158841bd4444b08919
+size 8356

runs/Apr30_09-04-55_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460729.a256-a40-06.srv.aau.dk.1177277.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9dc6e03813857ef156ff17ab9aa4424e0d4438796d704b0aee7e874e16f85b1e
+size 497

runs/Apr30_09-07-24_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460845.a256-a40-06.srv.aau.dk.1177625.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e0cc47c7b8761eaaf2c398b490d5012d7bd9b7ea08bc1c80d4f2b544c4d07212
+size 9728

runs/Apr30_09-07-24_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460885.a256-a40-06.srv.aau.dk.1177625.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e31f2484d90dfc7de1532a4d42d1a9abe2f59b943a23bc4bee49eb9e55573302
+size 497

runs/Apr30_09-08-54_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460935.a256-a40-06.srv.aau.dk.1177944.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d8c726c6dd28335d439479b324bea1425fb6b49b0aad966771c02cb4ec417a11
+size 9728

runs/Apr30_09-08-54_a256-a40-06.srv.aau.dk/events.out.tfevents.1714460977.a256-a40-06.srv.aau.dk.1177944.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ecd07554c34a2029be5ebf556d09406f15ca650338bf4f71dab2faed2007b6d3
+size 497

runs/Apr30_09-15-57_a256-a40-06.srv.aau.dk/events.out.tfevents.1714461358.a256-a40-06.srv.aau.dk.1178259.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:19df10aad91e42b56410f069b1e6bdabc4f1366ca2db24e0d016f71632cad809
+size 9728

runs/Apr30_09-15-57_a256-a40-06.srv.aau.dk/events.out.tfevents.1714461400.a256-a40-06.srv.aau.dk.1178259.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a381e0d682669ed275241121c0c594d3cb6566e377ba1c0616248180d0b2a531
+size 497

runs/Apr30_09-18-27_a256-a40-06.srv.aau.dk/events.out.tfevents.1714461508.a256-a40-06.srv.aau.dk.1178883.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:73e8a037a36cf8059866ea20dd55441897575f2e62f7219dc8e278ad84c71282
+size 12013

runs/Apr30_09-18-27_a256-a40-06.srv.aau.dk/events.out.tfevents.1714461568.a256-a40-06.srv.aau.dk.1178883.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:dd00ce344e15ea51de80a19ccf1973af6577b872a1a373522aff73634ad74ce4
+size 497

runs/Apr30_09-26-25_a256-a40-06.srv.aau.dk/events.out.tfevents.1714461986.a256-a40-06.srv.aau.dk.1179240.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:42a7b47a3e9f09aec7b8a924eee48761a36649715ee4fdd3f8d7437b16ff3997
+size 8813

runs/Apr30_09-26-25_a256-a40-06.srv.aau.dk/events.out.tfevents.1714462017.a256-a40-06.srv.aau.dk.1179240.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0636621560a42d24d25f0d36ed60a71a0c4564c60d823621a649516c6a8773b0
+size 497

runs/Apr30_09-52-25_a256-a40-06.srv.aau.dk/events.out.tfevents.1714463546.a256-a40-06.srv.aau.dk.1179669.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:6157fc7bf61e7e378e04367952dad5adae2b695e4228c23fe2e142ef1aaab1d2
+size 8813

runs/Apr30_09-52-25_a256-a40-06.srv.aau.dk/events.out.tfevents.1714463584.a256-a40-06.srv.aau.dk.1179669.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:21e51caae3efcd99817e4f0e45b12677686c6fa2443b10c671e164c16a94a6aa
+size 497

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
+}

tokenizer.json ADDED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,55 @@

+{
+  "added_tokens_decoder": {
+    "0": {
+      "content": "[PAD]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "100": {
+      "content": "[UNK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "101": {
+      "content": "[CLS]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "102": {
+      "content": "[SEP]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    },
+    "103": {
+      "content": "[MASK]",
+      "lstrip": false,
+      "normalized": false,
+      "rstrip": false,
+      "single_word": false,
+      "special": true
+    }
+  },
+  "clean_up_tokenization_spaces": true,
+  "cls_token": "[CLS]",
+  "do_lower_case": false,
+  "mask_token": "[MASK]",
+  "model_max_length": 512,
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "strip_accents": null,
+  "tokenize_chinese_chars": true,
+  "tokenizer_class": "BertTokenizer",
+  "unk_token": "[UNK]"
+}

training_args.bin ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:775332cd258f5ef5a9150cb35f6c09aab31c33211b13531005b25a0386937956
+size 4920

vocab.txt ADDED Viewed

The diff for this file is too large to render. See raw diff