edbeeching
/

DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Model card Files Files and versions

DeepScaler-DeepSeek-R1-Distill-Qwen-1.5B-GRPO

3.57 GB

1 contributor

History: 2 commits

edbeeching's picture

edbeeching HF Staff

Training in progress, step 50

f13bdd8 verified 10 months ago