Tristan/qwen3_sft_meta_sft_arc_easy_lr1e-6_wd0.0001_ep10_arc_easy Text Generation • 0.6B • Updated 13 days ago • 8
Tristan/sft_qwen3_lambada_it_custom_splits_lr1e-6_wd0.0001_ep10_lambada_openai_mt_it_custom_splits Text Generation • 0.6B • Updated 15 days ago • 6
Tristan/sft_qwen3_lambada_es_custom_splits_lr1e-6_wd0.0001_ep10_lambada_openai_mt_es_custom_splits Text Generation • 0.6B • Updated 15 days ago • 9
Tristan/sft_qwen3_lambada_fr_custom_splits_lr1e-6_wd0.0001_ep10_lambada_openai_mt_fr_custom_splits Text Generation • 0.6B • Updated 15 days ago • 9
Tristan/sft_qwen3_lambada_de_custom_splits_lr1e-6_wd0.0001_ep10_lambada_openai_mt_de_custom_splits Text Generation • 0.6B • Updated 15 days ago • 9
Tristan/sft_qwen3_lambada_en_custom_splits_lr1e-6_wd0.0001_ep10_lambada_openai_mt_en_custom_splits Text Generation • 0.6B • Updated 15 days ago • 16
Tristan/sft_qwen3_piqa_custom_splits_lr1e-6_wd0.001_ep10_piqa_custom_splits Text Generation • 0.6B • Updated 15 days ago • 83
Tristan/RedPajama-Data-V2-sample-100B-filtered-shuffled-tokenized-with-token-counts Viewer • Updated May 31, 2024 • 4.16M • 165
Tristan/RedPajama-Data-V2-sample-100B-filtered-for-regression-domains-with-domains Viewer • Updated May 24, 2024 • 4.16M • 122