End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 license: gemma
-base_model: jkazdan/step_val_25_gemma-2-2b_hs2_iter1_sftsd2
 tags:
 - trl
 - sft
@@ -15,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
 # 16R-16F-gemma-2-2b_hs2_iter1_sftsd1
-This model is a fine-tuned version of [jkazdan/step_val_25_gemma-2-2b_hs2_iter1_sftsd2](https://huggingface.co/jkazdan/step_val_25_gemma-2-2b_hs2_iter1_sftsd2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0950
 - Num Input Tokens Seen: 13600
 ## Model description
@@ -52,7 +52,7 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Input Tokens Seen |
 |:-------------:|:-----:|:----:|:---------------:|:-----------------:|
-| No log        | 0     | 0    | 1.0950          | 0                 |
 ### Framework versions

 ---
 license: gemma
+base_model: google/gemma-2-2b
 tags:
 - trl
 - sft
 # 16R-16F-gemma-2-2b_hs2_iter1_sftsd1
+This model is a fine-tuned version of [google/gemma-2-2b](https://huggingface.co/google/gemma-2-2b) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.3956
 - Num Input Tokens Seen: 13600
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss | Input Tokens Seen |
 |:-------------:|:-----:|:----:|:---------------:|:-----------------:|
+| No log        | 0     | 0    | 1.3956          | 0                 |
 ### Framework versions

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "jkazdan/step_val_25_gemma-2-2b_hs2_iter1_sftsd2",
   "architectures": [
     "Gemma2ForCausalLM"
   ],

 {
+  "_name_or_path": "google/gemma-2-2b",
   "architectures": [
     "Gemma2ForCausalLM"
   ],

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d1905178e1bceafb87d32935a11b7a94d62db1b2601b958bdeca016df3c19cc1
 size 4988025760

 version https://git-lfs.github.com/spec/v1
+oid sha256:b9fbdf37e086eb1a4fab58eb8a45973889b5525b6d60610da136fcf89f8effea
 size 4988025760

model-00002-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7ef2c990e63ce448db38aced7fed542776bcd346bade38790a8ad02f640c9807
 size 240691728

 version https://git-lfs.github.com/spec/v1
+oid sha256:c627b42b163313dd192ce823e4b97a9fbdb4d3bae5d46d5db3e161fc9e69679a
 size 240691728

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:67600657e88841e3649e602633cb2054438a2a323b029c03284792a6e833440a
 size 5560

 version https://git-lfs.github.com/spec/v1
+oid sha256:ce4b4b6993adf72d91f739ae75df3062c63896703ac48c5f2008ed66ae3d597b
 size 5560