Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ tags:
|
|
12 |
|
13 |
This is a quick "down and dirty" demo, with full sampler settings (3) to augment operation of "Llama-3.3-70B-Instruct" at "IQ1_S" (ultra low bit).
|
14 |
|
15 |
-
(can also apply these using IQ1_M, IQ2 quants too AND use for any 70B model at low quant levels.)
|
16 |
|
17 |
This will allow you to load and run this model on a 16 GB video card fully, at 2048 ctx and achieve 13-15 t/s.
|
18 |
|
|
|
12 |
|
13 |
This is a quick "down and dirty" demo, with full sampler settings (3) to augment operation of "Llama-3.3-70B-Instruct" at "IQ1_S" (ultra low bit).
|
14 |
|
15 |
+
(can also apply these using IQ1_M, IQ2 quants too AND you can use these settings for any 70B model at low quant levels.)
|
16 |
|
17 |
This will allow you to load and run this model on a 16 GB video card fully, at 2048 ctx and achieve 13-15 t/s.
|
18 |
|