End of training
Browse files- README.md +2 -2
- adapter_model.bin +1 -1
README.md
CHANGED
@@ -104,7 +104,7 @@ xformers_attention: null
|
|
104 |
|
105 |
This model is a fine-tuned version of [fxmarty/tiny-llama-fast-tokenizer](https://huggingface.co/fxmarty/tiny-llama-fast-tokenizer) on the None dataset.
|
106 |
It achieves the following results on the evaluation set:
|
107 |
-
- Loss: 10.
|
108 |
|
109 |
## Model description
|
110 |
|
@@ -142,7 +142,7 @@ The following hyperparameters were used during training:
|
|
142 |
| 10.3888 | 0.0001 | 1 | 10.3863 |
|
143 |
| 10.3902 | 0.0002 | 3 | 10.3862 |
|
144 |
| 10.3935 | 0.0004 | 6 | 10.3859 |
|
145 |
-
| 10.3808 | 0.0006 | 9 | 10.
|
146 |
|
147 |
|
148 |
### Framework versions
|
|
|
104 |
|
105 |
This model is a fine-tuned version of [fxmarty/tiny-llama-fast-tokenizer](https://huggingface.co/fxmarty/tiny-llama-fast-tokenizer) on the None dataset.
|
106 |
It achieves the following results on the evaluation set:
|
107 |
+
- Loss: 10.3853
|
108 |
|
109 |
## Model description
|
110 |
|
|
|
142 |
| 10.3888 | 0.0001 | 1 | 10.3863 |
|
143 |
| 10.3902 | 0.0002 | 3 | 10.3862 |
|
144 |
| 10.3935 | 0.0004 | 6 | 10.3859 |
|
145 |
+
| 10.3808 | 0.0006 | 9 | 10.3853 |
|
146 |
|
147 |
|
148 |
### Framework versions
|
adapter_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 33666
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:64975a076984ab9a23d2750f0aa36bc8e439616eca743d20cfe66a16f1a486e5
|
3 |
size 33666
|