End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -96,7 +96,7 @@ xformers_attention: true
 This model is a fine-tuned version of [unsloth/gemma-2-9b](https://huggingface.co/unsloth/gemma-2-9b) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0005
 ## Model description
@@ -134,8 +134,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 7.3346        | 0.0001 | 1    | 7.3348          |
-| 0.0           | 0.0037 | 25   | 0.0005          |
-| 0.0           | 0.0073 | 50   | 0.0005          |
 ### Framework versions

 This model is a fine-tuned version of [unsloth/gemma-2-9b](https://huggingface.co/unsloth/gemma-2-9b) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0006
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 7.3346        | 0.0001 | 1    | 7.3348          |
+| 0.0           | 0.0037 | 25   | 0.0006          |
+| 0.0           | 0.0073 | 50   | 0.0006          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "gate_proj",
     "up_proj",
-    "k_proj",
-    "o_proj",
     "q_proj",
-    "v_proj",
-    "down_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
+    "v_proj",
+    "down_proj",
     "gate_proj",
     "up_proj",
     "q_proj",
+    "k_proj",
+    "o_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9d8daa26c907594d808ccd4777f05b54a3687de87406842b939c1d5251b58360
 size 432357050

 version https://git-lfs.github.com/spec/v1
+oid sha256:eeaa03602a31484b0ff44f8e65e7f1d318d8cb5bdb0bfeabf434d4a3a848b06a
 size 432357050

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:25630314da8d1845f9f7a0a4394aa306b6564cd2b8833e9cd8ad779bf9addc6d
 size 432223744

 version https://git-lfs.github.com/spec/v1
+oid sha256:87b613f8f3cbac32004a0f82bf7a98b97c14988e1571669946774462adc90965
 size 432223744

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fdbadb3d73dc147a8615f83812e3216c6fad02d8d1e99a9c0d82882af039b8da
 size 6776

 version https://git-lfs.github.com/spec/v1
+oid sha256:4890609b6bb07cc885584bd813aee0e59702d7c9bfaeb0240be3c96e1382ae65
 size 6776