Michael Goin PRO

mgoin

AI & ML interests

LLM inference optimization, compression, quantization, pruning, distillation

Recent Activity

updated a model about 1 hour ago
nm-testing/pixtral-12b-FP8-dynamic-all
updated a model 3 days ago
mistralai/Pixtral-Large-Instruct-2411

Organizations

mgoin's activity

New activity in neuralmagic/pixtral-12b-FP8-dynamic 20 days ago

Update model card

#1 opened 20 days ago by nm-research

Oom with 24g vram

3
#1 opened about 2 months ago by Klopez
New activity in neuralmagic/Phi-3.5-mini-instruct-FP8-KV about 2 months ago
New activity in meta-llama/Llama-3.1-405B-Instruct 4 months ago

8-kv-heads

4
#17 opened 4 months ago by ArthurZ
New activity in meta-llama/Llama-3.1-405B 4 months ago

8-kv-heads

3
#21 opened 4 months ago by ArthurZ

run with vllm

8
#4 opened 4 months ago by kuliev-vitaly