End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6566
 ## Model description
@@ -47,15 +47,15 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.1138        | 0.11  | 1000 | 1.7761          |
-| 1.8565        | 0.22  | 2000 | 1.7291          |
-| 1.8133        | 0.33  | 3000 | 1.7039          |
-| 1.8198        | 0.44  | 4000 | 1.6900          |
-| 1.8024        | 0.55  | 5000 | 1.6771          |
-| 1.7781        | 0.66  | 6000 | 1.6691          |
-| 1.7742        | 0.77  | 7000 | 1.6626          |
-| 1.7517        | 0.88  | 8000 | 1.6577          |
-| 1.7566        | 0.99  | 9000 | 1.6566          |
 ### Framework versions

 This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6981
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 2.118         | 0.11  | 1000 | 1.7807          |
+| 1.8878        | 0.22  | 2000 | 1.7477          |
+| 1.8609        | 0.33  | 3000 | 1.7318          |
+| 1.8489        | 0.44  | 4000 | 1.7207          |
+| 1.8416        | 0.55  | 5000 | 1.7134          |
+| 1.8181        | 0.66  | 6000 | 1.7082          |
+| 1.8144        | 0.77  | 7000 | 1.7021          |
+| 1.816         | 0.88  | 8000 | 1.6987          |
+| 1.7825        | 0.99  | 9000 | 1.6981          |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -11,7 +11,7 @@
   "lora_dropout": 0.05,
   "modules_to_save": null,
   "peft_type": "LORA",
-  "r": 16,
   "revision": null,
   "target_modules": [
     "c_attn",

   "lora_dropout": 0.05,
   "modules_to_save": null,
   "peft_type": "LORA",
+  "r": 2,
   "revision": null,
   "target_modules": [
     "c_attn",

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d9f352df4e69658cd91e6c60af2576a5dc8b84064ffc61851b0704d78f84f2d8
-size 6513289

 version https://git-lfs.github.com/spec/v1
+oid sha256:d4d0f208eb91a5401bc33aea3bf3acacdd75f0a6f12af7c7b26a1c75f0798b97
+size 836233

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:60a43c9a70949f8b830037486d91e5e77ccaa62aaedc01ce811aff505e38a6d9
 size 4091

 version https://git-lfs.github.com/spec/v1
+oid sha256:aa37a1b2633f76e2d008cd2e07f89b0c7672d79bf41e03d5e7d9f8cb3b7a9e0d
 size 4091