End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -105,7 +105,7 @@ xformers_attention: true
 This model is a fine-tuned version of [tokyotech-llm/Llama-3-Swallow-8B-v0.1](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-8B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.4347
 ## Model description
@@ -137,7 +137,7 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 1.9885        | 0.0032 | 10   | 2.4347          |
 ### Framework versions

 This model is a fine-tuned version of [tokyotech-llm/Llama-3-Swallow-8B-v0.1](https://huggingface.co/tokyotech-llm/Llama-3-Swallow-8B-v0.1) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.4332
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.9874        | 0.0032 | 10   | 2.4332          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -20,12 +20,12 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "down_proj",
-    "up_proj",
-    "gate_proj",
-    "v_proj",
     "q_proj",
     "k_proj",
     "o_proj"
   ],
   "task_type": "CAUSAL_LM",

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "q_proj",
+    "gate_proj",
+    "up_proj",
     "k_proj",
+    "down_proj",
+    "v_proj",
     "o_proj"
   ],
   "task_type": "CAUSAL_LM",

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e55c4039f24bc44008a4f05902cf0a77621fb0cf6dfc3efceafdf89c3e1a4894
 size 167934026

 version https://git-lfs.github.com/spec/v1
+oid sha256:c5ecc4d45bf4c244e13fe6afd2578f5883763c30f3a40f653d33813331ec4481
 size 167934026

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fc775a1761f1f647c702d92e67ca658899ab5ef3cdebdf8519c573f4c4012e0b
 size 167832240

 version https://git-lfs.github.com/spec/v1
+oid sha256:2f4b65f869cbec574d4e40d83bda225537b43aa792b142eb5c462911291411e6
 size 167832240

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b45eef8e612d9b79c5583dcca2bd1a1d2b91284620f30ac5baebaa2439e5b269
 size 6776

 version https://git-lfs.github.com/spec/v1
+oid sha256:2453a0ab8bb2076577a26f096a7aad8f9abd4172336b492664a7eecef74a8938
 size 6776