End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1474
 ## Model description
@@ -50,17 +50,17 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.238         | 0.09  | 10   | 1.8587          |
-| 1.7567        | 0.18  | 20   | 1.5411          |
-| 1.2688        | 0.27  | 30   | 0.8329          |
-| 0.5198        | 0.36  | 40   | 0.2499          |
-| 0.1877        | 0.45  | 50   | 0.1580          |
-| 0.1639        | 0.54  | 60   | 0.1526          |
-| 0.1475        | 0.63  | 70   | 0.1475          |
-| 0.1626        | 0.73  | 80   | 0.1470          |
-| 0.1406        | 0.82  | 90   | 0.1481          |
-| 0.1536        | 0.91  | 100  | 0.1477          |
-| 0.1551        | 1.0   | 110  | 0.1474          |
 ### Framework versions

 This model is a fine-tuned version of [google/gemma-2b](https://huggingface.co/google/gemma-2b) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1488
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 2.2512        | 0.09  | 10   | 1.9189          |
+| 1.9251        | 0.18  | 20   | 1.9020          |
+| 1.8763        | 0.27  | 30   | 1.7975          |
+| 1.7047        | 0.36  | 40   | 1.5334          |
+| 1.3481        | 0.45  | 50   | 1.0845          |
+| 0.903         | 0.54  | 60   | 0.4961          |
+| 0.3472        | 0.63  | 70   | 0.2086          |
+| 0.1915        | 0.73  | 80   | 0.1576          |
+| 0.1476        | 0.82  | 90   | 0.1530          |
+| 0.1554        | 0.91  | 100  | 0.1492          |
+| 0.1582        | 1.0   | 110  | 0.1488          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -9,7 +9,8 @@
   "lora_dropout": 0.05,
   "merge_weights": false,
   "modules_to_save": null,
-  "peft_type": "LORA",
   "r": 4,
   "target_modules": [
     "q_proj",

   "lora_dropout": 0.05,
   "merge_weights": false,
   "modules_to_save": null,
+  "number_of_adapter_pre_layer": 8,
+  "peft_type": "M_LORA",
   "r": 4,
   "target_modules": [
     "q_proj",

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:855559f62eeeb569368897d7c355fb85fb9fbce1a2bf059dd3d5505c2dc2fa3d
-size 3712454

 version https://git-lfs.github.com/spec/v1
+oid sha256:40fda2a319f933566ba1593015ee7e536b6ce2abfd4a60524acd9c30f46ae5d2
+size 29712986

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d5240c481e558ce405f7675528dade66e337817981c7bb8bf4e594113eda16d4
-size 10028407656

 version https://git-lfs.github.com/spec/v1
+oid sha256:2c50a5392796b194b35873fb0eebe36bb720da9115155922bb88b26c9c8412af
+size 10054288712

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e5113a4c05a83fb0d58123bda7d63b0fbf75927924839a3409f242dda76947c0
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:8bdad090fa05589e3b6da89d04118381146e69af58ceef84f3f52bac7339c289
 size 5112