sedrickkeh commited on
Commit
22a5fa7
·
verified ·
1 Parent(s): e73b5f9

Model save

Browse files
README.md CHANGED
@@ -4,7 +4,6 @@ license: llama3.1
4
  base_model: meta-llama/Meta-Llama-3.1-8B
5
  tags:
6
  - llama-factory
7
- - full
8
  - generated_from_trainer
9
  model-index:
10
  - name: oh-dcft-v1.2_no-curation_gpt-4o-mini
@@ -16,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # oh-dcft-v1.2_no-curation_gpt-4o-mini
18
 
19
- This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) on the mlfoundations-dev/oh-dcft-v1.2_no-curation_gpt-4o-mini dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.6520
22
 
23
  ## Model description
24
 
@@ -55,9 +54,9 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss |
57
  |:-------------:|:-----:|:----:|:---------------:|
58
- | 0.6591 | 1.0 | 329 | 0.6591 |
59
- | 0.6101 | 2.0 | 658 | 0.6488 |
60
- | 0.5704 | 3.0 | 987 | 0.6520 |
61
 
62
 
63
  ### Framework versions
 
4
  base_model: meta-llama/Meta-Llama-3.1-8B
5
  tags:
6
  - llama-factory
 
7
  - generated_from_trainer
8
  model-index:
9
  - name: oh-dcft-v1.2_no-curation_gpt-4o-mini
 
15
 
16
  # oh-dcft-v1.2_no-curation_gpt-4o-mini
17
 
18
+ This model is a fine-tuned version of [meta-llama/Meta-Llama-3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 0.6519
21
 
22
  ## Model description
23
 
 
54
 
55
  | Training Loss | Epoch | Step | Validation Loss |
56
  |:-------------:|:-----:|:----:|:---------------:|
57
+ | 0.6591 | 1.0 | 329 | 0.6592 |
58
+ | 0.6102 | 2.0 | 658 | 0.6488 |
59
+ | 0.5707 | 3.0 | 987 | 0.6519 |
60
 
61
 
62
  ### Framework versions
model-00001-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fd1ab7668b777c90ac006b3d2a37777959282e39caf68931f9c4188d9913b578
3
  size 4976698672
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:06d826e0e395da40dd74c71128a3b1ae1a1a1b492eb86b735e261ab329929e05
3
  size 4976698672
model-00002-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:32b27f5ca180008f120726e147001f43b65765619359d71c777dbf589d16fd76
3
  size 4999802720
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a5f58bb5b595aac54eda080b8a3a38370bde145cb3faa6dc5a90a64c630750c4
3
  size 4999802720
model-00003-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:376569465f563abc5c7f0a557a004f97ebe9de9e446506df9b44bd632047b18a
3
  size 4915916176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e7e17bff910da9bbf13c055067eabd419ccddbc69318b2353b5340c5c57507ef
3
  size 4915916176
model-00004-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4edc33fe7782949d973dceda437ddb3debc2896b0e9b07a44bccdd7895f9d6aa
3
  size 1168138808
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:669f17e4cdfc06dd6843ac2a2d7b2f767292df4ea760eb03c537005da8484cee
3
  size 1168138808
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:54ac6424366eaeb9bb46e6e2b3f7103ee5b3693e18cb17196174ddad865b603f
3
  size 7224
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9e3c06fefd5e9133f5ebfb5b183836faad86fa6fb924fc415139d758b25fe295
3
  size 7224