arxiv:2411.02355
Alexandre Marques
alexmarques
AI & ML interests
None yet
Recent Activity
new activity
about 2 months ago
neuralmagic/Meta-Llama-3.1-8B-Instruct-quantized.w4a16:KV Cache Quantization - what is the default precision
updated
a model
about 2 months ago
neuralmagic/Qwen2.5-0.5B-Instruct-quantized.w8a8
updated
a model
about 2 months ago
nm-testing/llama-3-fp8-2of4-dynamic-uncompressed
Organizations
Papers
2
datasets
None public yet