Jeff Wadsworth
jeffwadsworth
AI & ML interests
Machine Learning
Recent Activity
new activity
9 days ago
Qwen/QwQ-32B:When will you fix the model replies missing</think>\n start tags
new activity
9 days ago
Qwen/QwQ-32B:Doesn't Generate `<think>` tags
liked
a Space
about 1 month ago
Qwen/Qwen2.5-Max-Demo
Organizations
None yet
jeffwadsworth's activity
When will you fix the model replies missing</think>\n start tags
17
#19 opened 9 days ago
by
xldistance
Doesn't Generate `<think>` tags
3
#25 opened 9 days ago
by
bingw5
When using the web version of DeepSeek v3, it keeps repeating responses without stopping.
1
#12 opened 3 months ago
by
Nydaym
Works very well, but I had an issue with the following prompt timing out.
2
#3 opened 6 months ago
by
jeffwadsworth
9.9 vs 9.11 example
11
#19 opened 6 months ago
by
IlyaGusev

a few edits for your model card (sorry I'm a grammar/writing nerd)
2
#16 opened 11 months ago
by
luke-data-leader

Model
25
#5 opened about 1 year ago
by
mrfakename

How to combine split files?
3
#1 opened about 1 year ago
by
deleted
My cpu is only using 50% of its cores.
2
#15 opened about 1 year ago
by
jeffwadsworth
model missing
7
#1 opened over 1 year ago
by
barius

This is a very impressive model. Using the 8bit version.
2
#2 opened over 1 year ago
by
jeffwadsworth
Tokenizer issue?
27
#1 opened over 1 year ago
by
sleepyjoecheated
Unzip problem
1
#15 opened over 1 year ago
by
ShubhangiV
This model looks insanely good for coding ( 73.2 for humanEval )!
18
#1 opened over 1 year ago
by
mirek190
Are the weights updated?
3
#1 opened over 1 year ago
by
krao
Performance of quantified models
1
#3 opened over 1 year ago
by
danielus

Finally got around to testing this model.
#1 opened over 1 year ago
by
jeffwadsworth
This model is amazing.
1
#1 opened almost 2 years ago
by
jeffwadsworth
Prompts
16
#2 opened almost 2 years ago
by
spirilis