Llamafied Models This is a collection of llamafied models - such as Qwen. Minami-su/Qwen1.5-0.5B-Chat_llamafy Text Generation • Updated Mar 18, 2024 • 1 • 5 raincandy-u/Qwen1.5-1.8B_llamafy Text Generation • 2B • Updated Apr 18, 2024 • 2 • 1 raincandy-u/Qwen1.5-4B_llamafy Text Generation • 4B • Updated Apr 18, 2024 • 1 • 1 Minami-su/Qwen1.5-7B-Chat_llamafy Text Generation • Updated Mar 18, 2024 • 2 • 7
Code Datasets layoric/tiny-codes-alpaca Viewer • Updated Oct 8, 2023 • 1.63M • 48 • 2 glaiveai/glaive-code-assistant-v3 Viewer • Updated May 20, 2024 • 950k • 233 • 57 ajibawa-2023/OpenHermes-2.5-Code-290k Updated Feb 19, 2024 • 25 • 7 TIGER-Lab/MathInstruct Viewer • Updated May 15, 2024 • 262k • 4.08k • 294
Papers Improving Small Language Models' Mathematical Reasoning via Mix Thoughts Distillation Paper • 2401.11864 • Published Jan 22, 2024 • 2
Improving Small Language Models' Mathematical Reasoning via Mix Thoughts Distillation Paper • 2401.11864 • Published Jan 22, 2024 • 2
Code DPO Datasets m-a-p/CodeFeedback-Filtered-Instruction Viewer • Updated Feb 26, 2024 • 157k • 1.42k • 170 AlekseyKorshuk/evol-codealpaca-v1-dpo Viewer • Updated Jan 25, 2024 • 39.9k • 41 • 6
Llamafied Models This is a collection of llamafied models - such as Qwen. Minami-su/Qwen1.5-0.5B-Chat_llamafy Text Generation • Updated Mar 18, 2024 • 1 • 5 raincandy-u/Qwen1.5-1.8B_llamafy Text Generation • 2B • Updated Apr 18, 2024 • 2 • 1 raincandy-u/Qwen1.5-4B_llamafy Text Generation • 4B • Updated Apr 18, 2024 • 1 • 1 Minami-su/Qwen1.5-7B-Chat_llamafy Text Generation • Updated Mar 18, 2024 • 2 • 7
Papers Improving Small Language Models' Mathematical Reasoning via Mix Thoughts Distillation Paper • 2401.11864 • Published Jan 22, 2024 • 2
Improving Small Language Models' Mathematical Reasoning via Mix Thoughts Distillation Paper • 2401.11864 • Published Jan 22, 2024 • 2
Code Datasets layoric/tiny-codes-alpaca Viewer • Updated Oct 8, 2023 • 1.63M • 48 • 2 glaiveai/glaive-code-assistant-v3 Viewer • Updated May 20, 2024 • 950k • 233 • 57 ajibawa-2023/OpenHermes-2.5-Code-290k Updated Feb 19, 2024 • 25 • 7 TIGER-Lab/MathInstruct Viewer • Updated May 15, 2024 • 262k • 4.08k • 294
Code DPO Datasets m-a-p/CodeFeedback-Filtered-Instruction Viewer • Updated Feb 26, 2024 • 157k • 1.42k • 170 AlekseyKorshuk/evol-codealpaca-v1-dpo Viewer • Updated Jan 25, 2024 • 39.9k • 41 • 6