-
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.00
Text Generation • 4B • Updated • 13 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.01
Text Generation • 4B • Updated • 12 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.00
Text Generation • 4B • Updated • 12 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.01
Text Generation • 4B • Updated • 10
Yuxiao Qu PRO
CohenQu
AI & ML interests
None yet
Recent Activity
updated
a model
about 2 hours ago
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_v2_orchard
updated
a model
about 3 hours ago
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_v2_orchard
updated
a model
about 4 hours ago
CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_v2_orchard
Organizations
Flexible Ordering
-
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.02
3B • Updated • 10 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.03
3B • Updated • 10 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.04
3B • Updated • 10 -
CohenQu/sft_completion_only_finemath-4plus-flexible-ordering.00.00-Step2000_numina-cot-100k_babel
Text Generation • 4B • Updated • 10
RLAD
-
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.00
Text Generation • 4B • Updated • 13 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.00.01
Text Generation • 4B • Updated • 12 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.00
Text Generation • 4B • Updated • 12 -
CohenQu/Qwen3-4B-Base_HintGen-withSol.01.01
Text Generation • 4B • Updated • 10
Flexible Ordering
-
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.02
3B • Updated • 10 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.03
3B • Updated • 10 -
CohenQu/llama3_3b-finemath-4plus-flexible-ordering.00.04
3B • Updated • 10 -
CohenQu/sft_completion_only_finemath-4plus-flexible-ordering.00.00-Step2000_numina-cot-100k_babel
Text Generation • 4B • Updated • 10
models
389

CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-math-4k_with_reasoning_v2_orchard
8B
•
Updated
•
12

CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-4k-demo_with_reasoning_v2_orchard
8B
•
Updated
•
3

CohenQu/Qwen2.5-ARC-AGI-4-8-10_3x128_shuffled_tb_32_bs_512_minibs_32_microbs_16_n_16_tp_0.6
3B
•
Updated
•
12

CohenQu/Qwen3-4B-ARC-AGI-4-8-10_tb_64_bs_256_minibs_16_microbs_16_n_16
4B
•
Updated
•
12

CohenQu/Qwen3-1.7B-ARC-AGI-4-8-10_tb_64_bs_256_minibs_16_microbs_16_n_16
2B
•
Updated
•
20

CohenQu/Meta-Llama-3-8B-Instruct_Mixture-of-Thoughts-all-4k-with_reasoning
Text Generation
•
1B
•
Updated
•
36

CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-all-4k_with_reasoning_fixed_DSAI
8B
•
Updated
•
30

CohenQu/LLaDA-8B-Instruct_Mixture-of-Thoughts-all-4k_without_reasoning_fixed_DSAI
8B
•
Updated
•
30

CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_256_minibs_16_microbs_16_n_16
2B
•
Updated
•
13

CohenQu/Qwen3-1.7B-deepscalar_RL_hard_500_verl_bs_512_minibs_16_microbs_16_n_32
2B
•
Updated
•
24
datasets
236
CohenQu/Mixture-of-Thoughts-math-4K-raw
Viewer
•
Updated
•
36.7k
•
11
CohenQu/ARC-AGI-4-8-10_3x128_shuffled
Viewer
•
Updated
•
684
•
65
CohenQu/ARC-AGI-4-8-10_3x128_ordered
Viewer
•
Updated
•
684
•
63
CohenQu/ARC-AGI-4-8-10
Viewer
•
Updated
•
5.3k
•
99
CohenQu/Continue_vs_Terminate.05.eval_prediction_process.08.18
Viewer
•
Updated
•
4.66k
•
128
CohenQu/Continue_vs_Terminate.05.eval_prediction_with_length.08.18
Viewer
•
Updated
•
63k
•
126
CohenQu/Continue_vs_Terminate.05.eval_prediction_process
Viewer
•
Updated
•
6.01k
•
134
CohenQu/Continue_vs_Terminate.05.eval_prediction_with_length
Viewer
•
Updated
•
81k
•
141
CohenQu/ARC-AGI-verl
Viewer
•
Updated
•
5.3k
•
113
CohenQu/ARC-AGI-transduction-rearc-prompt
Viewer
•
Updated
•
99.9k
•
115