End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -102,7 +102,7 @@ xformers_attention: true
 This model is a fine-tuned version of [unsloth/gemma-2-9b](https://huggingface.co/unsloth/gemma-2-9b) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7667
 ## Model description
@@ -140,8 +140,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 10.8108       | 0.0293 | 1    | 11.6694         |
-| 0.8095        | 0.7326 | 25   | 0.7893          |
-| 0.8004        | 1.4652 | 50   | 0.7667          |
 ### Framework versions

 This model is a fine-tuned version of [unsloth/gemma-2-9b](https://huggingface.co/unsloth/gemma-2-9b) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7679
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 10.8108       | 0.0293 | 1    | 11.6694         |
+| 0.8106        | 0.7326 | 25   | 0.7922          |
+| 0.798         | 1.4652 | 50   | 0.7679          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "q_proj",
-    "gate_proj",
-    "o_proj",
-    "down_proj",
     "up_proj",
     "k_proj",
-    "v_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "up_proj",
     "k_proj",
+    "gate_proj",
+    "q_proj",
+    "down_proj",
+    "v_proj",
+    "o_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f0e5cf6e9f771d372d050bc06d89bf620c04f8ceeb010cc8ec1298a1ea36c993
 size 432357050

 version https://git-lfs.github.com/spec/v1
+oid sha256:6f48a3f92e0231966828a21cab010f46bd6a268fa510004cbdb1a8672cd46100
 size 432357050

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8cdad695471ac5ebe1288a36a991fb68aa3a4df78e52765c1b084596c758dd4d
 size 432223744

 version https://git-lfs.github.com/spec/v1
+oid sha256:1de24fc8f01deb18a95dcc6729ffe30a69cd909f7e3333ffc10f50ea664e1e25
 size 432223744

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:768050f293879227102c7c91bc296e82e7ed39164763de483fc202082d0b9cf5
 size 6776

 version https://git-lfs.github.com/spec/v1
+oid sha256:071494bbe95931a08631bf47f2dba38f70b866d5bb370d5380baf7279eff516c
 size 6776