End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -102,7 +102,7 @@ xformers_attention: true
 This model is a fine-tuned version of [unsloth/Qwen2.5-Coder-1.5B-Instruct](https://huggingface.co/unsloth/Qwen2.5-Coder-1.5B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2251
 ## Model description
@@ -140,8 +140,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.0219        | 0.0099 | 1    | 1.3110          |
-| 0.4688        | 0.2472 | 25   | 0.2870          |
-| 0.4603        | 0.4944 | 50   | 0.2251          |
 ### Framework versions

 This model is a fine-tuned version of [unsloth/Qwen2.5-Coder-1.5B-Instruct](https://huggingface.co/unsloth/Qwen2.5-Coder-1.5B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2240
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.0219        | 0.0099 | 1    | 1.3110          |
+| 0.4663        | 0.2472 | 25   | 0.2859          |
+| 0.4577        | 0.4944 | 50   | 0.2240          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "o_proj",
-    "up_proj",
     "gate_proj",
     "k_proj",
     "v_proj",
-    "down_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "q_proj",
     "gate_proj",
     "k_proj",
     "v_proj",
+    "o_proj",
+    "up_proj",
+    "down_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6114a35a3b39bb840d918fd79a854cb4e78556415734a59537c883b9533c2617
 size 147859242

 version https://git-lfs.github.com/spec/v1
+oid sha256:8efe635127ba5c809807794fa5ad152c4c6c5375c721f0cc98b44708c168a38a
 size 147859242

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6628c07e632a154792d1c63a45e296337c556fd1251face0659bd3d5b1367340
 size 147770496

 version https://git-lfs.github.com/spec/v1
+oid sha256:919c8484bf6e987c96c7843faece2bf9df6f3f371220a7b1773a974f49cf4c36
 size 147770496

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:59b265d13211b75297ab419de4a75d8f881d2ff067a757da0360af522259e6af
 size 6776

 version https://git-lfs.github.com/spec/v1
+oid sha256:2fffa403338c34b764a434c6a622e84733739871feeb07b89e87b6e2503f8f06
 size 6776