End of training

Browse files

Files changed (5) hide show

README.md +25 -6
model.safetensors +1 -1
runs/Jun06_02-41-03_a62fcfc33d39/events.out.tfevents.1717641676.a62fcfc33d39.34.0 +3 -0
runs/Jun06_03-40-43_a62fcfc33d39/events.out.tfevents.1717645246.a62fcfc33d39.34.1 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -1,8 +1,8 @@
 ---
 license: mit
 tags:
 - generated_from_trainer
-base_model: Aravindan/gpt2out
 model-index:
 - name: gpt2coder-8epochs
   results: []
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Aravindan/gpt2out](https://huggingface.co/Aravindan/gpt2out) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.0309
 ## Model description
@@ -43,13 +43,32 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
-- num_epochs: 1
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss |
-|:-------------:|:------:|:----:|:---------------:|
-| 2.2389        | 0.9998 | 1739 | 2.0309          |
 ### Framework versions

 ---
 license: mit
+base_model: Aravindan/gpt2out
 tags:
 - generated_from_trainer
 model-index:
 - name: gpt2coder-8epochs
   results: []
 This model is a fine-tuned version of [Aravindan/gpt2out](https://huggingface.co/Aravindan/gpt2out) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6964
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_ratio: 0.1
+- num_epochs: 20
 ### Training results
+| Training Loss | Epoch   | Step | Validation Loss |
+|:-------------:|:-------:|:----:|:---------------:|
+| No log        | 0.9962  | 132  | 1.5428          |
+| No log        | 2.0     | 265  | 1.4204          |
+| No log        | 2.9962  | 397  | 1.2888          |
+| 1.5725        | 4.0     | 530  | 1.1900          |
+| 1.5725        | 4.9962  | 662  | 1.1045          |
+| 1.5725        | 6.0     | 795  | 1.0314          |
+| 1.5725        | 6.9962  | 927  | 0.9723          |
+| 1.217         | 8.0     | 1060 | 0.9139          |
+| 1.217         | 8.9962  | 1192 | 0.8689          |
+| 1.217         | 10.0    | 1325 | 0.8274          |
+| 1.217         | 10.9962 | 1457 | 0.7910          |
+| 1.0164        | 12.0    | 1590 | 0.7555          |
+| 1.0164        | 12.9962 | 1722 | 0.7266          |
+| 1.0164        | 14.0    | 1855 | 0.7014          |
+| 1.0164        | 14.9962 | 1987 | 0.6777          |
+| 0.8885        | 16.0    | 2120 | 0.6597          |
+| 0.8885        | 16.9962 | 2252 | 0.6440          |
+| 0.8885        | 18.0    | 2385 | 0.6327          |
+| 0.8106        | 18.9962 | 2517 | 0.6239          |
+| 0.8106        | 19.9962 | 2640 | 0.6964          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:db5ed9c70628f43009980c4fd88939d66c16dec74f5d068c1f6ff9a974b1e663
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:efeb99fbaf67a15c9a11dfb2fc55a0f2f5b74e9433d0f47526c76f8f4887c22e
 size 497774208

runs/Jun06_02-41-03_a62fcfc33d39/events.out.tfevents.1717641676.a62fcfc33d39.34.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:34d52a6d0ee51396d6cee63e78a44b40d0960be4226e3ff81aa5e65620a52409
+size 5721

runs/Jun06_03-40-43_a62fcfc33d39/events.out.tfevents.1717645246.a62fcfc33d39.34.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a7dbaf20a922f78efc3031e2be994643d76b5c144a24d132369e8f0482c4a3e9
+size 11081

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:58c41c320c81744d1d2af38cb685c1228f0782b20aeb38f5c3a9cd0b7042c804
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:9363ca53d937dbdb4298149be39434a15fbc3ad7f0392de9539f3eb2c063d52f
 size 5112