lemonilia
/

LimaRP-Mistral-7B-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lemonilia commited on Oct 19, 2023

Commit

13cd5b9

•

1 Parent(s): caa42e7

Update README.md

Files changed (1) hide show

README.md +5 -2

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ license: apache-2.0
 # LimaRP-Mistral-7B-v0.1 (Alpaca)
 This is a version of LimaRP for [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) with
-about 2000 training samples _up to_ 9k tokens length
 For more details about LimaRP, see the model page for the [previously released v2 version for Llama-2](https://huggingface.co/lemonilia/limarp-llama2-v2).
 Most details written there apply for this version as well. Generally speaking, LimaRP is a longform-oriented, novel-style
@@ -100,10 +100,13 @@ on 4x NVidia A40 GPUs.
 The A40 GPUs have been graciously provided by [Arc Compute](https://www.arccompute.io/).
 ### Training hyperparameters
 - learning_rate: 0.0003
 - lr_scheduler: constant_with_warmup
 - noisy_embedding_alpha: 5
-- num_epochs: 2
 - sequence_len: 8750
 - lora_r: 256
 - lora_alpha: 16

 # LimaRP-Mistral-7B-v0.1 (Alpaca)
 This is a version of LimaRP for [Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) with
+about 2000 training samples _up to_ 9k tokens length.
 For more details about LimaRP, see the model page for the [previously released v2 version for Llama-2](https://huggingface.co/lemonilia/limarp-llama2-v2).
 Most details written there apply for this version as well. Generally speaking, LimaRP is a longform-oriented, novel-style
 The A40 GPUs have been graciously provided by [Arc Compute](https://www.arccompute.io/).
 ### Training hyperparameters
+Although 1 training epoch was used, the underlying data comprised data repeated twice
+in slightly different formats.
 - learning_rate: 0.0003
 - lr_scheduler: constant_with_warmup
 - noisy_embedding_alpha: 5
+- num_epochs: 1
 - sequence_len: 8750
 - lora_r: 256
 - lora_alpha: 16