End of training

Files changed (6) hide show

README.md CHANGED Viewed

@@ -1,22 +1,21 @@
 ---
 license: mit
 tags:
 - generated_from_trainer
-base_model: gpt2
 model-index:
-- name: gpt2out
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/super-duper/huggingface/runs/d28072v4)
-# gpt2out
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.6611
 ## Model description
@@ -44,20 +43,25 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 3
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 125  | 4.5357          |
-| No log        | 2.0   | 250  | 3.8487          |
-| No log        | 3.0   | 375  | 3.6611          |
 ### Framework versions
-- Transformers 4.42.0.dev0
-- Pytorch 2.2.1+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1

 ---
 license: mit
+base_model: gpt2
 tags:
 - generated_from_trainer
 model-index:
+- name: gpt2coder-8epochs
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# gpt2coder-8epochs
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.6054
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 8
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 125  | 4.7927          |
+| No log        | 2.0   | 250  | 3.7089          |
+| No log        | 3.0   | 375  | 3.2773          |
+| 4.5477        | 4.0   | 500  | 3.0126          |
+| 4.5477        | 5.0   | 625  | 2.8363          |
+| 4.5477        | 6.0   | 750  | 2.7137          |
+| 4.5477        | 7.0   | 875  | 2.6372          |
+| 2.8689        | 8.0   | 1000 | 2.6054          |
 ### Framework versions
+- Transformers 4.41.1
+- Pytorch 2.1.2
 - Datasets 2.19.1
 - Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -33,7 +33,7 @@
     }
   },
   "torch_dtype": "float32",
-  "transformers_version": "4.42.0.dev0",
   "use_cache": true,
   "vocab_size": 50257
 }

     }
   },
   "torch_dtype": "float32",
+  "transformers_version": "4.41.1",
   "use_cache": true,
   "vocab_size": 50257
 }

generation_config.json CHANGED Viewed

@@ -2,5 +2,5 @@
   "_from_model_config": true,
   "bos_token_id": 50256,
   "eos_token_id": 50256,
-  "transformers_version": "4.42.0.dev0"
 }

   "_from_model_config": true,
   "bos_token_id": 50256,
   "eos_token_id": 50256,
+  "transformers_version": "4.41.1"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e63e4bede592863337e5fe4db753a9551c101fc324ba5e5d6ad7b8b4d76e4704
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:9e7646390bf9e93424f606063f889306b319ffb5c6e3534cf9534aceb74f492e
 size 497774208

runs/Jun02_06-40-28_3e2a6de4d5c1/events.out.tfevents.1717310435.3e2a6de4d5c1.35.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:5ec8483edcb4319de689feaa8d7d41905f22376a682abfa9338be881480a70e5
+size 7967

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:58db82922c455fabff718887fb57d9b49b2a695774339c56ce1f6265eb982047
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:5980876d519391750dedffd367b98344d37b11d489c07832e40f84b6c89dbfbc
 size 5112