Skills, datasets, etc for DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
Zaid Khan
codezakh
·
AI & ML interests
None yet
Recent Activity
updated
a dataset
10 days ago
mlfoundations-dev/dpo_from_multiple_samples_shortest_numina_aime
published
a dataset
10 days ago
mlfoundations-dev/dpo_from_multiple_samples_shortest_numina_aime
updated
a dataset
11 days ago
mlfoundations-dev/dpo_from_stratos_judged_annotated_rejected_responses