Text Generation
GGUF
creative
creative writing
fiction writing
plot generation
sub-plot generation
story generation
scene continue
storytelling
fiction story
science fiction
romance
all genres
story
writing
vivid prosing
vivid writing
fiction
roleplaying
bfloat16
role play
128k context
llama3.2
Inference Endpoints
imatrix
conversational
Could you quant a Q4_0_4_8 ?
#1
by
DocStrangeLoop
- opened
Hi there, I love this model, it performs absurdly well on mobile at Q3KM, could you also provide arm quantizations like Q4_0_4_8?
@DocStrangeLoop These will be up in about an hour or so.
@DavidAU Looks like they uploaded for the 1B but not the 3B. Ty for the quick response.
See what I can do ; don't have an ETA at the moment.