Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
254.6
TFLOPS
21
GadflyII
GadflyII
Follow
Geodd's profile picture
AlexGS74's profile picture
21world's profile picture
27 followers
·
1 following
AI & ML interests
None yet
Recent Activity
new
activity
11 days ago
GadflyII/GLM-4.7-Flash-MTP-NVFP4:
SGLang and MTP
new
activity
24 days ago
GadflyII/Qwen3-Coder-Next-NVFP4:
Model requests?
new
activity
24 days ago
GadflyII/Qwen3-Coder-Next-NVFP4:
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
View all activity
Organizations
GadflyII
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
GadflyII/GLM-4.7-Flash-MTP-NVFP4
11 days ago
SGLang and MTP
1
#2 opened 23 days ago by
Michalea
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
24 days ago
Model requests?
12
#4 opened about 1 month ago by
pathosethoslogos
Why Your NVFP4 Model Is Slower Than FP8 on the GB10 (NVIDIA Spark) — And How to Fix It
👍
2
2
#5 opened about 1 month ago by
scottgl
New activity in
GadflyII/GLM-4.6V-NVFP4
24 days ago
Fails on a single DGX spark with errors below
1
#2 opened 29 days ago by
Adrian1234
New activity in
GadflyII/GLM-4.7-Flash-MXFP4
about 1 month ago
Update MXFP4 format to compressed-tensors
1
#3 opened about 1 month ago by
mgoin
New activity in
lukealonso/MiniMax-M2.5-NVFP4
about 1 month ago
Here's the vLLM recipe I'm using with 2x RTX Pro 6000
👍
3
17
#1 opened about 1 month ago by
zenmagnets
New activity in
GadflyII/Qwen3-Coder-Next-NVFP4
about 2 months ago
MMLU PRO Benchmark
3
#3 opened about 2 months ago by
sevapru
vLLM 0.16?
1
#2 opened about 2 months ago by
MMaxHugg
Memory
1
#1 opened about 2 months ago by
struxx
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
about 2 months ago
confused response
7
#8 opened about 2 months ago by
jiangyizhi
MTP quality, 47 layer
3
#7 opened about 2 months ago by
Michalea
New activity in
GadflyII/GLM-4.7-Flash-MTP-NVFP4
about 2 months ago
Upload folder using huggingface_hub
#1 opened about 2 months ago by
GadflyII
New activity in
GadflyII/GLM-4.6V-NVFP4
about 2 months ago
Well done nvfp4 quant
1
#1 opened about 2 months ago by
josephbreda
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
2 months ago
Can't deploy by vllm 0.14.1 + transformers
8
#6 opened 2 months ago by
Butterfly-314
New activity in
GadflyII/GLM-4.7-Flash-MXFP4
2 months ago
can not run
4
#1 opened 2 months ago by
aliez-ren
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
2 months ago
please create mlx version of this
3
#4 opened 2 months ago by
Narutoouz
Wasn't able to recreate MMLU-Pro benchmarks
5
#5 opened 2 months ago by
zenmagnets
New activity in
GadflyII/MiniMax-M2.1-NVFP4
2 months ago
Request for GLM 4.6V
3
#1 opened 3 months ago by
SFPLM
New activity in
GadflyII/GLM-4.7-Flash-NVFP4
2 months ago
GadflyII/GLM-4.7-Flash-NVFP4
15
#3 opened 2 months ago by
Yu21342
Really appreciate that you ran performance comparison tests with BF16!
3
#2 opened 2 months ago by
zenmagnets
Load more