jkazdan commited on
Commit
14b0608
·
verified ·
1 Parent(s): ad903e8

End of training

Browse files
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: gemma
3
- base_model: jkazdan/step_val_25_gemma-2-2b_hs2_iter1_sftsd2
4
  tags:
5
  - trl
6
  - sft
@@ -15,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # 16R-16F-gemma-2-2b_hs2_iter1_sftsd1
17
 
18
- This model is a fine-tuned version of [jkazdan/step_val_25_gemma-2-2b_hs2_iter1_sftsd2](https://huggingface.co/jkazdan/step_val_25_gemma-2-2b_hs2_iter1_sftsd2) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.0950
21
  - Num Input Tokens Seen: 13600
22
 
23
  ## Model description
@@ -52,7 +52,7 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Input Tokens Seen |
54
  |:-------------:|:-----:|:----:|:---------------:|:-----------------:|
55
- | No log | 0 | 0 | 1.0950 | 0 |
56
 
57
 
58
  ### Framework versions
 
1
  ---
2
  license: gemma
3
+ base_model: google/gemma-2-2b
4
  tags:
5
  - trl
6
  - sft
 
15
 
16
  # 16R-16F-gemma-2-2b_hs2_iter1_sftsd1
17
 
18
+ This model is a fine-tuned version of [google/gemma-2-2b](https://huggingface.co/google/gemma-2-2b) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.3956
21
  - Num Input Tokens Seen: 13600
22
 
23
  ## Model description
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Input Tokens Seen |
54
  |:-------------:|:-----:|:----:|:---------------:|:-----------------:|
55
+ | No log | 0 | 0 | 1.3956 | 0 |
56
 
57
 
58
  ### Framework versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "jkazdan/step_val_25_gemma-2-2b_hs2_iter1_sftsd2",
3
  "architectures": [
4
  "Gemma2ForCausalLM"
5
  ],
 
1
  {
2
+ "_name_or_path": "google/gemma-2-2b",
3
  "architectures": [
4
  "Gemma2ForCausalLM"
5
  ],
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:d1905178e1bceafb87d32935a11b7a94d62db1b2601b958bdeca016df3c19cc1
3
  size 4988025760
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b9fbdf37e086eb1a4fab58eb8a45973889b5525b6d60610da136fcf89f8effea
3
  size 4988025760
model-00002-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7ef2c990e63ce448db38aced7fed542776bcd346bade38790a8ad02f640c9807
3
  size 240691728
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c627b42b163313dd192ce823e4b97a9fbdb4d3bae5d46d5db3e161fc9e69679a
3
  size 240691728
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:67600657e88841e3649e602633cb2054438a2a323b029c03284792a6e833440a
3
  size 5560
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce4b4b6993adf72d91f739ae75df3062c63896703ac48c5f2008ed66ae3d597b
3
  size 5560