Kusawa kusawa's picture

5 5

Kusawa kusawa

sunnykusawa

·

AI & ML interests

LLM, Quantization, FineTunning

Organizations

None yet

sunnykusawa's activity

New activity in MaziyarPanahi/Meta-Llama-3.1-8B-Instruct-GGUF 6 months ago

unable to load quantized 4bit_m

#5 opened 7 months ago by

New activity in meta-llama/Llama-3.1-8B-Instruct 7 months ago

unable to load 4-bit quantized varient with llama.cpp

#31 opened 7 months ago by

New activity in mistralai/Mixtral-8x7B-Instruct-v0.1 10 months ago

Input token size issue, does it realy supports 32k tokens?

#197 opened 10 months ago by

Input validation error: `inputs` tokens + `max_new_tokens` must be <= 2048. on Mixtral8x7b 32K token

#199 opened 10 months ago by

New activity in TheBloke/CodeLlama-13B-Instruct-GGUF 10 months ago

Deploy Quantized model on AWS Sagemaker

#4 opened 10 months ago by