Zheng Han's picture

Zheng Han

traphix

·

AI & ML interests

None yet

Recent Activity

new activity about 13 hours ago

RedHatAI/Qwen3.5-122B-A10B-FP8-dynamic:Creation details?

new activity about 14 hours ago

nivvis/Qwen3.5-122B-A10B-heretic-v2-FP8:Which framework was used for FP8 quantization? LLM-compressor?

new activity 1 day ago

huihui-ai/Huihui-Qwen3-Coder-Next-abliterated:GPTQ quantization

View all activity

Organizations

None yet

New activity in RedHatAI/Qwen3.5-122B-A10B-FP8-dynamic about 13 hours ago

Creation details?

#2 opened about 13 hours ago by

New activity in nivvis/Qwen3.5-122B-A10B-heretic-v2-FP8 about 14 hours ago

Which framework was used for FP8 quantization? LLM-compressor?

#1 opened 1 day ago by

New activity in huihui-ai/Huihui-Qwen3-Coder-Next-abliterated 1 day ago

GPTQ quantization

#2 opened about 1 month ago by

New activity in win10/Huihui-Qwen3.5-27B-abliterated-FP8 1 day ago

Which framework was used to quantize this model? llm-compressor? or Can you share the quantization Python script?

#1 opened 1 day ago by

New activity in edp1096/Huihui-Qwen3.5-27B-abliterated-FP8 1 day ago

Which framework was used to quantize this model? llm-compressor? or Can you share the quantization Python script?

#2 opened 1 day ago by

New activity in inference-optimization/Qwen3-Coder-Next.w4a16 11 days ago

Question about weight_observer？

#1 opened 11 days ago by

New activity in RedHatAI/MiniMax-M2.5 21 days ago

INT4 w4a16 quantinization？

#1 opened 21 days ago by

New activity in RedHatAI/Qwen3.5-397B-A17B-FP8-dynamic 22 days ago

Quantization code for int4(w4a16) ?

#6 opened 22 days ago by

W4A16 quant

#1 opened about 1 month ago by

New activity in RedHatAI/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16 2 months ago

Tokenizer you are loading with an incorrect regex pattern

#2 opened 3 months ago by

New activity in Intel/Qwen3-Next-80B-A3B-Instruct-int4-AutoRound 3 months ago

Failed to find a kernel that can implement the WNA16 linear layer

#1 opened 3 months ago by

New activity in RedHatAI/Qwen3-Next-80B-A3B-Instruct-quantized.w4a16 3 months ago

vllm error: Extra inputs are not permitted

#1 opened 3 months ago by

New activity in RedHatAI/Qwen3-235B-A22B-Instruct-2507-NVFP4 4 months ago

Can A100 run Qwen3-235B-A22B-Instruct-2507-NVFP4?

#1 opened 4 months ago by

New activity in Qwen/Qwen3-Next-80B-A3B-Instruct-FP8 6 months ago

Error on 4 x L40s

#4 opened 6 months ago by

I got ValueError

#3 opened 6 months ago by

New activity in shanjiaz/qwen3-80b-fp8-dynamic 6 months ago

How to run this model via vllm？

#2 opened 6 months ago by

New activity in Qwen/Qwen3-Next-80B-A3B-Instruct 6 months ago

FP8 please

#18 opened 6 months ago by

New activity in DevQuasar/Qwen.Qwen3-Next-80B-A3B-Instruct-FP8-Dynamic 6 months ago

vllm v0.10.2 error

#2 opened 6 months ago by

New activity in DevQuasar/Qwen.Qwen3-Next-80B-A3B-Instruct-FP8 6 months ago

VLLM compatibility?

#1 opened 6 months ago by

New activity in shanjiaz/qwen3-80b-fp8-dynamic 6 months ago

Instruct or Thinking?

#1 opened 6 months ago by