End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -114,7 +114,7 @@ xformers_attention: null
 This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v0.6](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v0.6) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 5.1028
 ## Model description
@@ -153,10 +153,10 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 7.6743        | 0.0028 | 1    | 7.1293          |
-| 7.7429        | 0.0141 | 5    | 7.0976          |
-| 7.0434        | 0.0283 | 10   | 6.3337          |
-| 5.7719        | 0.0424 | 15   | 5.4988          |
-| 5.695         | 0.0565 | 20   | 5.1028          |
 ### Framework versions

 This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-Chat-v0.6](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v0.6) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.1046
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 7.6743        | 0.0028 | 1    | 7.1293          |
+| 7.7426        | 0.0141 | 5    | 7.0973          |
+| 7.0417        | 0.0283 | 10   | 6.3321          |
+| 5.7705        | 0.0424 | 15   | 5.4978          |
+| 5.6954        | 0.0565 | 20   | 5.1046          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,13 +20,13 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "up_proj",
-    "o_proj",
-    "gate_proj",
     "v_proj",
-    "k_proj",
     "down_proj",
-    "q_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "v_proj",
+    "up_proj",
     "down_proj",
+    "gate_proj",
+    "o_proj",
+    "q_proj",
+    "k_proj"
   ],
   "task_type": "CAUSAL_LM",
   "use_dora": false,

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:66a28a9f8f443f7dce46fe12d04b3a6652a55ffeeeba0e4609553b913fa87b77
 size 101036698

 version https://git-lfs.github.com/spec/v1
+oid sha256:75d6d99a911e1898887876a012bb5f063d98f1d0f108c408c15a9a760ccc085a
 size 101036698

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3c0ecc2e9a75d14c753ea2123cc052ce634a5ace8a27427e3887b3bd040ff88f
 size 100966336

 version https://git-lfs.github.com/spec/v1
+oid sha256:6937250c3c124f08bcfec45cd4a325f3405cb3f1f96296f7ad31bd5500713d84
 size 100966336

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:676c5b957839737d77485c6332fc96de69385066f3eef0d74b47f677462f11dc
 size 6712

 version https://git-lfs.github.com/spec/v1
+oid sha256:0b55d7424e01d2da7c827809f0e75e814ca3307809331f5d1880db7934ea6a65
 size 6712