Spaces:
Running
on
Zero
Running
on
Zero
Remove Granite 2B
#1
by
adamelliotfields
- opened
Should only use models that fit in a single safetensors file; must be under 2B.
As of right now, the only alternative non-thinking model is Gemma 2 2B Gemma 3 1B. Thinking models eat up ZeroGPU time, so best saved for API.
adamelliotfields
changed discussion title from
Replace Granite 2B with Qwen 1.5B R1 Distilled
to Remove Granite 2B