End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,12 +15,9 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/stojchets/huggingface/runs/i6m5qe3a)
 # sft8
 This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on the generator dataset.
-It achieves the following results on the evaluation set:
-- Loss: 1.2183
 ## Model description
@@ -50,13 +47,6 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_steps: 200
 - num_epochs: 3
-### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 0.9118        | 2.56  | 100  | 1.2183          |
 ### Framework versions
 - Transformers 4.43.0.dev0

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 # sft8
 This model is a fine-tuned version of [deepseek-ai/deepseek-coder-1.3b-base](https://huggingface.co/deepseek-ai/deepseek-coder-1.3b-base) on the generator dataset.
 ## Model description
 - lr_scheduler_warmup_steps: 200
 - num_epochs: 3
 ### Framework versions
 - Transformers 4.43.0.dev0

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ed41fe036f0d5e0645509e89ff42493832332e568a5a68694776ab6cdec76220
 size 4986380064

 version https://git-lfs.github.com/spec/v1
+oid sha256:f5807ff21250ee2b230b0837acffe2574bd54930960d8a50d6064e7ce60d3dc8
 size 4986380064

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a6d635569f6cecd009a6d3432c2664440ab244a0511de8e1155bf790f20ae245
 size 399532808

 version https://git-lfs.github.com/spec/v1
+oid sha256:3a3113a47c7526125eb1f53220f2648dba28b4aa29dfa8d3763bb86d5514ea10
 size 399532808

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e4c8b423b83f730d8d147fb9079fc9e2b307dabe286a83f8e3b69b4d72a23480
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:d2d24c9e8ddd9f280e16e434e179f0949fcddd23052521124a754e0b3ccc4acd
 size 5240