Dan Voyce
the1dv
AI & ML interests
None yet
Organizations
None yet
Rope Scaling pre-applied?
6
#1 opened 8 months ago
by
the1dv
ValueError: There is no module or parameter named 'lm_head.biases' in Qwen3ForCausalLM
2
#1 opened 8 months ago
by
the1dv
ValueError: There is no module or parameter named 'lm_head.biases' in Qwen3ForCausalLM
2
#1 opened 8 months ago
by
the1dv
Rope Scaling pre-applied?
6
#1 opened 8 months ago
by
the1dv