DavidAU
/

L3-4X8B-MOE-Dark-Planet-Infinite-25B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

DavidAU commited on Dec 12, 2024

Commit

15c438f

·

verified ·

1 Parent(s): 00a0dce

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -13,6 +13,16 @@ The source code can also be used directly.
 NOTE: Links to GGUFs below.
 <B>IMPORTANT: Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers</B>
 If you are going to use this model, (source, GGUF or a different quant), please review this document for critical parameter, sampler and advance sampler settings (for multiple AI/LLM aps).

 NOTE: Links to GGUFs below.
+MOE SPECIFIC NOTE:
+If you want to change the "default" number of experts set, modify the "config.json" :
+"num_experts_per_tok": 2,
+The user will still be able to modify it, if the LLM/AI app has the setting option to do this.
+Each time you add/subtract an expert the token per second speed will change.
 <B>IMPORTANT: Highest Quality Settings / Optimal Operation Guide / Parameters and Samplers</B>
 If you are going to use this model, (source, GGUF or a different quant), please review this document for critical parameter, sampler and advance sampler settings (for multiple AI/LLM aps).