End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1676
 ## Model description
@@ -52,13 +52,13 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.1521        | 0.3704 | 50   | 0.1676          |
 ### Framework versions
-- PEFT 0.13.0
-- Transformers 4.45.1
 - Pytorch 2.4.1+cu121
 - Datasets 3.0.1
-- Tokenizers 0.20.0

 This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1606
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.1688        | 0.3704 | 50   | 0.1606          |
 ### Framework versions
+- PEFT 0.13.2
+- Transformers 4.45.2
 - Pytorch 2.4.1+cu121
 - Datasets 3.0.1
+- Tokenizers 0.20.1

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "down_proj",
     "k_proj",
     "v_proj",
     "up_proj",
-    "o_proj",
-    "gate_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "o_proj",
     "k_proj",
+    "gate_proj",
     "v_proj",
+    "q_proj",
     "up_proj",
+    "down_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:79c272e6d0b153955297eb9162efc4c8a05c97784c04e2b7dbec9b5a5d8acfa0
 size 35669232

 version https://git-lfs.github.com/spec/v1
+oid sha256:b08bd427dd3b517563d30ef20f8780a6bbbc694d3367972912dc00863c0e8651
 size 35669232

runs/Oct17_02-03-31_09c52a70180b/events.out.tfevents.1729130616.09c52a70180b.560.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ec6eab9cb638974a8417148c74cadbdc1e66fca4e7816b1456b035fd53b8d52e
+size 10390

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:caabc4b13f97198f29f8ee2853a911dd28c98b111dc0ff66e318c3c44e554646
 size 5560

 version https://git-lfs.github.com/spec/v1
+oid sha256:304cd4f1a5efc0e934bfadb4e5545be57d68d0a473517f5bb74491ecd082399b
 size 5560