Michael Goin
mgoin
AI & ML interests
LLM inference optimization, compression, quantization, pruning, distillation
Recent Activity
updated
a model
1 day ago
mgoin/GLM-4.6-FP8-BLOCK
published
a model
1 day ago
mgoin/GLM-4.6-FP8-BLOCK
updated
a model
28 days ago
inference-optimization/NVIDIA-Nemotron-3-Nano-30B-A3B-NVFP4