valerielucro
/
Qwen2-0.5B-GRPO_peft
like
0
Model card
Files
Files and versions
Metrics
Training metrics
Community