End of training

Files changed (13) hide show

README.md CHANGED Viewed

@@ -1,6 +1,7 @@
 ---
-license: apache-2.0
-base_model: distilbert/distilroberta-base
 tags:
 - generated_from_trainer
 model-index:
@@ -13,9 +14,9 @@ should probably proofread and complete it, then remove this comment. -->
 # my_awesome_eli5_mlm_model
-This model is a fine-tuned version of [distilbert/distilroberta-base](https://huggingface.co/distilbert/distilroberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.8569
 ## Model description
@@ -40,20 +41,20 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 1    | 3.6705          |
-| No log        | 2.0   | 2    | 2.4108          |
-| No log        | 3.0   | 3    | 3.3469          |
 ### Framework versions
-- Transformers 4.33.0
 - Pytorch 2.1.0+cu121
-- Datasets 2.14.5
-- Tokenizers 0.13.3

 ---
+library_name: transformers
+license: mit
+base_model: vinai/bertweet-base
 tags:
 - generated_from_trainer
 model-index:
 # my_awesome_eli5_mlm_model
+This model is a fine-tuned version of [vinai/bertweet-base](https://huggingface.co/vinai/bertweet-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.9677
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 3.0
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 2    | 4.1429          |
+| No log        | 2.0   | 4    | 4.6141          |
+| No log        | 3.0   | 6    | 5.0930          |
 ### Framework versions
+- Transformers 4.45.1
 - Pytorch 2.1.0+cu121
+- Datasets 3.0.1
+- Tokenizers 0.20.0

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "distilbert/distilroberta-base",
   "architectures": [
     "RobertaForMaskedLM"
   ],
@@ -7,21 +7,23 @@
   "bos_token_id": 0,
   "classifier_dropout": null,
   "eos_token_id": 2,
   "hidden_act": "gelu",
   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "layer_norm_eps": 1e-05,
-  "max_position_embeddings": 514,
   "model_type": "roberta",
   "num_attention_heads": 12,
-  "num_hidden_layers": 6,
   "pad_token_id": 1,
   "position_embedding_type": "absolute",
   "torch_dtype": "float32",
-  "transformers_version": "4.33.0",
   "type_vocab_size": 1,
   "use_cache": true,
-  "vocab_size": 50265
 }

 {
+  "_name_or_path": "vinai/bertweet-base",
   "architectures": [
     "RobertaForMaskedLM"
   ],
   "bos_token_id": 0,
   "classifier_dropout": null,
   "eos_token_id": 2,
+  "gradient_checkpointing": false,
   "hidden_act": "gelu",
   "hidden_dropout_prob": 0.1,
   "hidden_size": 768,
   "initializer_range": 0.02,
   "intermediate_size": 3072,
   "layer_norm_eps": 1e-05,
+  "max_position_embeddings": 130,
   "model_type": "roberta",
   "num_attention_heads": 12,
+  "num_hidden_layers": 12,
   "pad_token_id": 1,
   "position_embedding_type": "absolute",
+  "tokenizer_class": "BertweetTokenizer",
   "torch_dtype": "float32",
+  "transformers_version": "4.45.1",
   "type_vocab_size": 1,
   "use_cache": true,
+  "vocab_size": 64001
 }

model.safetensors ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9f7ca76ca708e9d22ff69d948122e6d804b3c32d54b81bde16de6312c1fc8780
+size 539886236

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4d74624e222fbcf5831e1b34049eb87f8b2999a83624a0fd5c7c6e1d2b45d234
 size 328715954

 version https://git-lfs.github.com/spec/v1
+oid sha256:1409c3891cb4bc953e07f89d5cd6094011810d7288b1f6406b6edec9340a6399
 size 328715954

runs/Sep30_02-09-34_uc2n265.localdomain/events.out.tfevents.1727654975.uc2n265.localdomain.792160.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:42980c4fac57b1a05c2059d0388df3786406aeb68f72b4b2e33aa5a76349c76f
+size 4227

runs/Sep30_03-10-57_uc2n265.localdomain/events.out.tfevents.1727658658.uc2n265.localdomain.792160.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ac4ee619bc485eac2df4c65ef1a28038b698b0ab1a850288a5b0b615054f330a
+size 4158

runs/Sep30_03-12-33_uc2n265.localdomain/events.out.tfevents.1727658760.uc2n265.localdomain.792160.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a0a921ff5338e343c23f07530fa202f8c40c5fe9d0568a7d7c9f108324ef4acf
+size 4158

runs/Sep30_03-15-52_uc2n265.localdomain/events.out.tfevents.1727658955.uc2n265.localdomain.839845.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ea1896098971b61ac1c0c366bbc59d1ef412a9bce03b118ccf8f324b79628cb2
+size 5304

runs/Sep30_03-46-52_uc2n265.localdomain/events.out.tfevents.1727660812.uc2n265.localdomain.842046.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:83a21bb2fa9614a9953f840df9c05fdc818a41b3fae4ed84c05cddd620f3728c
+size 5304

runs/Sep30_03-46-52_uc2n265.localdomain/events.out.tfevents.1727661254.uc2n265.localdomain.842046.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:7bda9dc04fd6626da2e7dbccfb94644dba4af03a92b50eebfe4f6d76ae9a20c0
+size 354

runs/Sep30_03-55-14_uc2n265.localdomain/events.out.tfevents.1727661315.uc2n265.localdomain.842545.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:2f46de6613fab87ff1fb9390cbd09c9c865fe7030b52356053256606d57a6a05
+size 5304

runs/Sep30_04-15-59_uc2n265.localdomain/events.out.tfevents.1727662560.uc2n265.localdomain.843776.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:416f135ef17ed1460d94316625add62088098ba123e85db44269662cabbc0845
+size 5304

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4962340522f916df6a0c881f7433520c98f18b60718a9c20899b0712a38d6b2c
-size 4536

 version https://git-lfs.github.com/spec/v1
+oid sha256:cc6717532bceccdd93f9408262c994414f0679462a2899dc4c4de5353df41836
+size 5176