Training in progress epoch 0

Files changed (5) hide show

README.md CHANGED Viewed

@@ -1,5 +1,4 @@
 ---
-license: apache-2.0
 tags:
 - generated_from_keras_callback
 model-index:
@@ -12,10 +11,12 @@ probably proofread and complete it, then remove this comment. -->
 # veb/twitch-bert-base-cased-finetuned
-This model is a fine-tuned version of [bert-base-cased](https://huggingface.co/bert-base-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 4.4766
-- Validation Loss: 3.6452
 - Epoch: 0
 ## Model description
@@ -35,14 +36,14 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'WarmUp', 'config': {'initial_learning_rate': 2e-05, 'decay_schedule_fn': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': -939, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, '__passive_serialization__': True}, 'warmup_steps': 1000, 'power': 1.0, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
-- training_precision: mixed_float16
 ### Training results
-| Train Loss | Validation Loss | Epoch |
-|:----------:|:---------------:|:-----:|
-| 4.4766     | 3.6452          | 0     |
 ### Framework versions

 ---
 tags:
 - generated_from_keras_callback
 model-index:
 # veb/twitch-bert-base-cased-finetuned
+This model was trained from scratch on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 0.2929
+- Train Sparse Categorical Accuracy: 0.8768
+- Validation Loss: 0.1927
+- Validation Sparse Categorical Accuracy: 0.9483
 - Epoch: 0
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'Adam', 'learning_rate': 5e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
+- training_precision: float32
 ### Training results
+| Train Loss | Train Sparse Categorical Accuracy | Validation Loss | Validation Sparse Categorical Accuracy | Epoch |
+|:----------:|:---------------------------------:|:---------------:|:--------------------------------------:|:-----:|
+| 0.2929     | 0.8768                            | 0.1927          | 0.9483                                 | 0     |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,7 +1,7 @@
 {
-  "_name_or_path": "bert-base-cased",
   "architectures": [
-    "BertForMaskedLM"
   ],
   "attention_probs_dropout_prob": 0.1,
   "classifier_dropout": null,

 {
+  "_name_or_path": "veb/twitch-bert-base-cased-finetuned",
   "architectures": [
+    "BertForSequenceClassification"
   ],
   "attention_probs_dropout_prob": 0.1,
   "classifier_dropout": null,

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7e8d11ea5f5a1e84365d578094f53fee37f44af79e8ec91d568ecf09e6e49285
-size 524302448

 version https://git-lfs.github.com/spec/v1
+oid sha256:b2f508293dc744f769dc04c1f63cda14895450cf69850a06dc85ff12f489c232
+size 433518320

tokenizer.json CHANGED Viewed

@@ -1,7 +1,21 @@
 {
   "version": "1.0",
-  "truncation": null,
-  "padding": null,
   "added_tokens": [
     {
       "id": 0,

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 512,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
+  "padding": {
+    "strategy": {
+      "Fixed": 512
+    },
+    "direction": "Right",
+    "pad_to_multiple_of": null,
+    "pad_id": 0,
+    "pad_type_id": 0,
+    "pad_token": "[PAD]"
+  },
   "added_tokens": [
     {
       "id": 0,

tokenizer_config.json CHANGED Viewed

	@@ -1 +1 @@
1	- {"do_lower_case": false, "unk_token": "[UNK]", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]", "tokenize_chinese_chars": true, "strip_accents": null, "model_max_length": 512, "special_tokens_map_file": null, "name_or_path": "bert-base-cased", "tokenizer_class": "BertTokenizer"}


1	+ {"do_lower_case": false, "unk_token": "[UNK]", "sep_token": "[SEP]", "pad_token": "[PAD]", "cls_token": "[CLS]", "mask_token": "[MASK]", "tokenize_chinese_chars": true, "strip_accents": null, "model_max_length": 512, "special_tokens_map_file": null, "name_or_path": "veb/twitch-bert-base-cased-finetuned", "tokenizer_class": "BertTokenizer"}