smangrul commited on
Commit
2f16536
1 Parent(s): 52eed36

Model save

Browse files
README.md CHANGED
@@ -19,7 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T) on the wikisql dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 0.0562
23
 
24
  ## Model description
25
 
@@ -54,7 +54,7 @@ The following hyperparameters were used during training:
54
 
55
  | Training Loss | Epoch | Step | Validation Loss |
56
  |:-------------:|:-----:|:----:|:---------------:|
57
- | 0.0664 | 1.0 | 263 | 0.0562 |
58
 
59
 
60
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T) on the wikisql dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 0.0457
23
 
24
  ## Model description
25
 
 
54
 
55
  | Training Loss | Epoch | Step | Validation Loss |
56
  |:-------------:|:-----:|:----:|:---------------:|
57
+ | 0.0482 | 1.0 | 263 | 0.0457 |
58
 
59
 
60
  ### Framework versions
adapter_config.json CHANGED
@@ -19,14 +19,12 @@
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
- "gate_proj",
23
- "up_proj",
24
- "embed_tokens",
25
  "down_proj",
26
- "o_proj",
27
  "v_proj",
28
- "q_proj",
29
- "lm_head",
 
30
  "k_proj"
31
  ],
32
  "task_type": "CAUSAL_LM",
 
19
  "rank_pattern": {},
20
  "revision": null,
21
  "target_modules": [
22
+ "q_proj",
 
 
23
  "down_proj",
 
24
  "v_proj",
25
+ "up_proj",
26
+ "o_proj",
27
+ "gate_proj",
28
  "k_proj"
29
  ],
30
  "task_type": "CAUSAL_LM",
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:235d1bf7bb0a017a1da20cef0977c5d6507e56bfcb0c08336f1d44faf561be0b
3
- size 551740408
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:163b8c71767bb6675fae40f92c4bab0d491181d5560e921665fa31f276452bf4
3
+ size 25271744
runs/Jan17_06-54-16_hf-dgx-01/events.out.tfevents.1705470859.hf-dgx-01.2323147.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fda4e71e29986c23024a8322410bb20aecac6c7a6b25d9f9917527fb4398a34d
3
+ size 13170
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:54665397da31f83aabde86527007bad96daaf3abdbbc8eee2c378f30ec28f5c8
3
  size 4728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dd7494a366eab6dfdab91658ee32de530ff5e3a039463d7000aebcd686a9241d
3
  size 4728