redrix commited on
Commit
5cdf41b
1 Parent(s): 1ab741c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -18,6 +18,7 @@ license: apache-2.0
18
  > He stood in the way, for he didn't understand. Unfortunate - there was potential.
19
 
20
  This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 
21
  This is my fifth model. The original model was created as a simple test of *Model Stock*, thus the name (nepoticide=nephew, not as important nor direct). A broken tokenizer caused me to remerge the model. I also chose to use [TheDrummer/UnslopNemo-12B-v4](https://huggingface.co/TheDrummer/UnslopNemo-12B-v4), as TheDrummer stated that this model has more anti-gptism influence while taking a hit to intelligence, which should get balanced by the other models.
22
 
23
  ## Testing stage: early testing
@@ -27,7 +28,10 @@ I do not know how this model holds up over long term context. Early testing show
27
  - **Context size:** Not more than *20k* recommended - coherency may degrade.
28
  - **Chat Template:** *ChatML*; Metharme/Pygmalion (as per UnslopNemo) may work, but effects are untested
29
  - **Samplers:** A *Temperature-Last* of 1 and *Min-P* of 0.1 are viable, but haven't been finetuned. Activate *DRY* if repetition appears. *XTC* is untested.
 
30
  ## Quantization
 
 
31
  ## Parameters
32
  - **Context size:** Not more than *20k* recommended - coherency may degrade.
33
  - **Chat Template:** *ChatML*; Metharme/Pygmalion (as per UnslopNemo) may work, but effects are untested
 
18
  > He stood in the way, for he didn't understand. Unfortunate - there was potential.
19
 
20
  This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
21
+
22
  This is my fifth model. The original model was created as a simple test of *Model Stock*, thus the name (nepoticide=nephew, not as important nor direct). A broken tokenizer caused me to remerge the model. I also chose to use [TheDrummer/UnslopNemo-12B-v4](https://huggingface.co/TheDrummer/UnslopNemo-12B-v4), as TheDrummer stated that this model has more anti-gptism influence while taking a hit to intelligence, which should get balanced by the other models.
23
 
24
  ## Testing stage: early testing
 
28
  - **Context size:** Not more than *20k* recommended - coherency may degrade.
29
  - **Chat Template:** *ChatML*; Metharme/Pygmalion (as per UnslopNemo) may work, but effects are untested
30
  - **Samplers:** A *Temperature-Last* of 1 and *Min-P* of 0.1 are viable, but haven't been finetuned. Activate *DRY* if repetition appears. *XTC* is untested.
31
+
32
  ## Quantization
33
+ Static **GGUF** Quants available at [redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax-v2-GGUF](https://huggingface.co/redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax-v2-GGUF)
34
+
35
  ## Parameters
36
  - **Context size:** Not more than *20k* recommended - coherency may degrade.
37
  - **Chat Template:** *ChatML*; Metharme/Pygmalion (as per UnslopNemo) may work, but effects are untested