sayanbanerjee32 commited on
Commit
98f1585
1 Parent(s): 49cf445

End of training

Browse files
README.md CHANGED
@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.1606
22
 
23
  ## Model description
24
 
@@ -52,7 +52,8 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:------:|:----:|:---------------:|
55
- | 0.1688 | 0.3704 | 50 | 0.1606 |
 
56
 
57
 
58
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [microsoft/Phi-3.5-mini-instruct](https://huggingface.co/microsoft/Phi-3.5-mini-instruct) on the None dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.1667
22
 
23
  ## Model description
24
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:------:|:----:|:---------------:|
55
+ | 0.3316 | 0.1852 | 25 | 0.2099 |
56
+ | 0.1567 | 0.3704 | 50 | 0.1667 |
57
 
58
 
59
  ### Framework versions
adapter_config.json CHANGED
@@ -20,13 +20,13 @@
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
23
- "o_proj",
 
24
  "k_proj",
 
25
  "gate_proj",
26
- "v_proj",
27
- "q_proj",
28
- "up_proj",
29
- "down_proj"
30
  ],
31
  "task_type": "CAUSAL_LM",
32
  "use_dora": false,
 
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
23
+ "q_proj",
24
+ "v_proj",
25
  "k_proj",
26
+ "down_proj",
27
  "gate_proj",
28
+ "o_proj",
29
+ "up_proj"
 
 
30
  ],
31
  "task_type": "CAUSAL_LM",
32
  "use_dora": false,
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b08bd427dd3b517563d30ef20f8780a6bbbc694d3367972912dc00863c0e8651
3
  size 35669232
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5bc5f7844254d65aeed2b9147fc8bea07b5972c83b18f679ad5424ffd972981a
3
  size 35669232
runs/Oct18_10-18-36_ddde5c08535d/events.out.tfevents.1729246720.ddde5c08535d.581.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ba0531c40fa9ca6f16ab65675a9a70939a4242b9c14c2fdcf4b536ef4c5c7e51
3
+ size 8739
runs/Oct18_10-25-49_ddde5c08535d/events.out.tfevents.1729247151.ddde5c08535d.581.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8ab3f0705bd80bd985b8d161620c019d0781ce2729b753fd0d262d104eba3b01
3
+ size 10654
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:304cd4f1a5efc0e934bfadb4e5545be57d68d0a473517f5bb74491ecd082399b
3
  size 5560
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9bec72c5bff4340c7a8eb9edc63f3762ed5f8966db95753b023434944d73a950
3
  size 5560