DavidAU commited on
Commit
7eb48f3
·
verified ·
1 Parent(s): 342da20

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -12,7 +12,7 @@ tags:
12
 
13
  This is a quick "down and dirty" demo, with full sampler settings (3) to augment operation of "Llama-3.3-70B-Instruct" at "IQ1_S" (ultra low bit).
14
 
15
- (can also apply these using IQ1_M, IQ2 quants too AND use for any 70B model at low quant levels.)
16
 
17
  This will allow you to load and run this model on a 16 GB video card fully, at 2048 ctx and achieve 13-15 t/s.
18
 
 
12
 
13
  This is a quick "down and dirty" demo, with full sampler settings (3) to augment operation of "Llama-3.3-70B-Instruct" at "IQ1_S" (ultra low bit).
14
 
15
+ (can also apply these using IQ1_M, IQ2 quants too AND you can use these settings for any 70B model at low quant levels.)
16
 
17
  This will allow you to load and run this model on a 16 GB video card fully, at 2048 ctx and achieve 13-15 t/s.
18