Yiran Wang
yiran-wang3
AI & ML interests
Reinforcement Learning, Self-Driving
Recent Activity
updated
a dataset
about 6 hours ago
reflection-gen/ds_coder_rmsprop_iter4_sppo_hard_new_cn_mining_oj_iter4-binarized_all_pairs
updated
a dataset
about 6 hours ago
reflection-gen/ds_coder_rmsprop_iter4_sppo_hard_new_cn_mining_oj_iter4-full_response_traceback
updated
a dataset
about 6 hours ago
reflection-gen/ds_coder_rmsprop_iter4_sppo_hard_new_cn_mining_oj_iter4-binarized
Organizations
Collections
3
sppo training with original gt (no explaination)
-
yiran-wang3/ds_chat_cosine_original_sppo-GT_ORIGINAL-sppo-0.1-cos-rmsp-1e-7-checkpoint-391
Text Generation • Updated • 5 -
yiran-wang3/ds_chat_cosine_original_sppo-GT_ORIGINAL-sppo-0.1-cos-rmsp-1e-7-checkpoint-1173
Text Generation • Updated • 4 -
yiran-wang3/ds_chat_cosine_original_sppo-GT_ORIGINAL-sppo-0.1-cos-rmsp-1e-7-checkpoint-782
Text Generation • Updated • 6 -
yiran-wang3/ds_chat_cosine_original_sppo-GT_ORIGINAL_MASKED-sppo-0.1-cos-rmsp-1e-7-checkpoint-391
Text Generation • Updated • 5
models
252
yiran-wang3/ds_coder_rmsprop_iter5
Updated
yiran-wang3/sigmoid_ds_chat_rmsprop_iter5
Text Generation
•
Updated
yiran-wang3/ds_coder_rmsprop_iter4
Text Generation
•
Updated
yiran-wang3/ds_coder6.7b_rmsprop_iter5
Text Generation
•
Updated
yiran-wang3/sigmoid_ds_chat_rmsprop_iter4
Text Generation
•
Updated
yiran-wang3/ds_coder_rmsprop_iter3
Text Generation
•
Updated
yiran-wang3/ds_coder6.7b_rmsprop_iter4
Text Generation
•
Updated
yiran-wang3/sigmoid_ds_chat_rmsprop_iter3
Text Generation
•
Updated
yiran-wang3/ds_coder_rmsprop_iter2
Text Generation
•
Updated
yiran-wang3/ds_coder6.7b_rmsprop_iter3
Text Generation
•
Updated
datasets
14
yiran-wang3/original_cn_rl_oj_debug_iter0-full_response_traceback
Viewer
•
Updated
•
2
•
31
yiran-wang3/original_cn_rl_oj_debug_iter0-binarized
Viewer
•
Updated
•
2
•
31
yiran-wang3/cleaned-mining-deepseek-llm-python-binarized-gt-replace
Viewer
•
Updated
•
24.6k
•
39
yiran-wang3/cleaned-mining-codellama-python-base-all-binarized
Viewer
•
Updated
•
26.6k
•
38
yiran-wang3/cleaned-mining-codellama-instruct-base-all-binarized
Viewer
•
Updated
•
26.6k
•
58
yiran-wang3/cleaned-mining-deepseekcoder67-base-all-binarized
Viewer
•
Updated
•
20.1k
•
50
•
1
yiran-wang3/cleaned-mining-deepseekllm-base-all-detailed-score-from-error-binarized
Viewer
•
Updated
•
25k
•
48
yiran-wang3/cleaned-mining-deepseekllm-base-all-binarized
Viewer
•
Updated
•
25k
•
38
yiran-wang3/cleaned-mining-deepseekllm-sft-packing-dpo-binarized
Viewer
•
Updated
•
12.5k
•
49
yiran-wang3/mistral-sft-iter1-eval
Viewer
•
Updated
•
1.25k
•
44