Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
29.1
TFLOPS
134
7
29
Steve Li
CHNtentes
Follow
ltim's profile picture
Mi6paulino's profile picture
badunc's profile picture
5 followers
·
16 following
CHNtentes
AI & ML interests
None yet
Recent Activity
new
activity
about 12 hours ago
zai-org/GLM-4.6-FP8:
Missing MTP?
upvoted
a
paper
1 day ago
Less is More: Recursive Reasoning with Tiny Networks
new
activity
1 day ago
unsloth/GLM-4.5-Air-GGUF:
What speed do you get at Q8 on AMD Ryzen™ AI Max+ 395
View all activity
Organizations
None yet
CHNtentes
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
zai-org/GLM-4.6-FP8
about 12 hours ago
Missing MTP?
4
#1 opened 8 days ago by
jondurbin
upvoted
a
paper
1 day ago
Less is More: Recursive Reasoning with Tiny Networks
Paper
•
2510.04871
•
Published
5 days ago
•
280
New activity in
unsloth/GLM-4.5-Air-GGUF
1 day ago
What speed do you get at Q8 on AMD Ryzen™ AI Max+ 395
4
#14 opened 5 days ago by
akierum
New activity in
Qwen/Qwen3-Omni-30B-A3B-Instruct
14 days ago
Quantization issues
4
#17 opened 15 days ago by
stev236
New activity in
Qwen/Qwen3-Omni-30B-A3B-Instruct
18 days ago
Update README.md
1
#9 opened 18 days ago by
CHNtentes
upvoted
2 collections
18 days ago
Qwen3-VL
Collection
9 items
•
Updated
9 days ago
•
183
Qwen3Guard
Collection
7 items
•
Updated
11 days ago
•
49
upvoted
a
collection
19 days ago
Qwen3-Omni
Collection
6 items
•
Updated
3 days ago
•
153
New activity in
unsloth/Qwen3-Next-80B-A3B-Instruct-bnb-4bit
25 days ago
Error when using vLLM
➕
11
5
#2 opened 28 days ago by
sheliak
New activity in
LLM360/K2-Think
26 days ago
Evaluation sloppiness / benchmark cheating?
👍
2
1
#9 opened 26 days ago by
jaens
New activity in
unsloth/GLM-4.5-Air-GGUF
28 days ago
Corrected jinja template with tool Support works with PR llama.cpp/pull/15186
❤️
2
16
#9 opened 2 months ago by
xbruce22
New activity in
cpatonn/Qwen3-Next-80B-A3B-Instruct-AWQ-4bit
29 days ago
Error when running in VLLM
👍
2
18
#1 opened 29 days ago by
d8rt8v
New activity in
Qwen/Qwen3-Next-80B-A3B-Instruct
29 days ago
How much GPU memory is needed for local deployment?
13
#7 opened 30 days ago by
XuehangCang
Plan for AWQ?
➕
24
2
#8 opened 30 days ago by
hyunw55
liked
a model
about 1 month ago
IndexTeam/IndexTTS-2
Updated
Sep 8
•
25.3k
•
383
New activity in
moonshotai/Kimi-K2-Instruct-0905
about 1 month ago
Considering a distilled version of 80B parameters
➕
1
5
#2 opened about 1 month ago by
snapo
liked
a model
about 1 month ago
moonshotai/Kimi-K2-Instruct-0905
Text Generation
•
Updated
1 day ago
•
20.8k
•
•
473
New activity in
unsloth/DeepSeek-V3.1-GGUF
about 2 months ago
changed tool call format?
1
#2 opened about 2 months ago by
CHNtentes
Thanks!
❤️
2
5
#1 opened about 2 months ago by
segmond
New activity in
deepseek-ai/DeepSeek-V3.1
about 2 months ago
Context length: is it 128K (as mentioned in the model card) or 160K (as specified in config.json)?
1
#17 opened about 2 months ago by
Lissanro
Load more