brucethemoose
/

Capybara-Tess-Yi-34B-200K-exl2-4bpw-fiction

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

brucethemoose commited on Nov 19, 2023

Commit

a90048d

•

1 Parent(s): ba8e2e7

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ language:
 library_name: transformers
 pipeline_tag: text-generation
 ---
-NousResearch/Nous-Capybara-34B and migtissera/Tess-M-Creative-v1.0 ties merged with mergekit, then quantized with exllamav2 on 200 rows (400K tokens) on a long Vicuna format chat, a sci fi story and a fantasy story. This should yield better chat performance than the default wikitext quantization.
 Quantized to 4bpw, enough for **~47K context on a 24GB GPU.**

 library_name: transformers
 pipeline_tag: text-generation
 ---
+Nous-Capybara-34B and Tess-M-Creative-v1.0 merged, then quantized with exllamav2 on 200 rows (400K tokens) on a long Vicuna format chat, a sci fi story and a fantasy story. This should hopefully yield better chat performance than the default wikitext quantization.
 Quantized to 4bpw, enough for **~47K context on a 24GB GPU.**