view post Post 1771 You can just ask things 🗣️"show me messages in the coding category that are in the top 10% of reward model scores"Download really high quality instructions from the Llama3.1 405B synthetic dataset 🔥 argilla/magpie-ultra-v1.0 See translation 🔥 6 6 👀 5 5 👍 1 1 + Reply
Tulu 3 Datasets Collection All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated 20 days ago • 60
PixMo Collection A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 20 days ago • 48