openaccess-ai-collective
/

minotaur-mpt-7b

@@ -17,6 +17,7 @@ datasets:
 - camel-ai/biology
 - camel-ai/physics
 - camel-ai/chemistry
 inference: false
 ---
@@ -25,7 +26,7 @@ inference: false
 # Minotaur MPT 7B
-Minotaur 7B is an instruct fine-tuned model on top of MPT-7B.
 Questions, comments, feedback, looking to donate, or want to help? Reach out on our [Discord](https://discord.gg/PugNNHAF5r) or email [wing@openaccessaicollective.org](mailto:wing@openaccessaicollective.org)
@@ -36,25 +37,25 @@ Chat only style prompts using `USER:`,`ASSISTANT:`.
 Minotaur 7B model is fine-tuned on the following datasets:
-- [riddle_sense](https://huggingface.co/datasets/riddle_sense) - instruct augmented
-- hellaswag, updated for detailed explanations w 30K+ rows
-- [gsm8k](https://huggingface.co/datasets/gsm8k) - instruct augmented
 - [WizardLM](https://huggingface.co/datasets/ehartford/WizardLM_alpaca_evol_instruct_70k_unfiltered)
 - [subset of QingyiSi/Alpaca-CoT for roleplay and CoT](https://huggingface.co/QingyiSi/Alpaca-CoT)
 - [GPTeacher-General-Instruct](https://huggingface.co/datasets/teknium/GPTeacher-General-Instruct)
-- ARC-Easy & ARC-Challenge - instruct augmented for detailed responses, derived from the `train` split
-- [hellaswag](https://huggingface.co/datasets/hellaswag) - 5K row subset of instruct augmented for concise responses, derived from the `train` split
 - [metaeval/ScienceQA_text_only](https://huggingface.co/datasets/metaeval/ScienceQA_text_only) - instruct for concise responses
 - [openai/summarize_from_feedback](https://huggingface.co/datasets/openai/summarize_from_feedback) - instruct augmented tl;dr summarization
 - [camel-ai/math](https://huggingface.co/datasets/camel-ai/math)
 - [camel-ai/physics](https://huggingface.co/datasets/camel-ai/physics)
 - [camel-ai/chemistry](https://huggingface.co/datasets/camel-ai/chemistry)
 - [camel-ai/biology](https://huggingface.co/datasets/camel-ai/biology)
-- custom sysnthetic datasets around misconceptions, in-context qa, jokes, N-tasks problems, and context-insensitivity
 # Shoutouts
-Special thanks to Nanobit for helping with Axolotl, TheBloke for quantizing these models are more accessible to all, ehartford for cleaned datasets, and 0x000011b for the RP dataset.
 # Demo

 - camel-ai/biology
 - camel-ai/physics
 - camel-ai/chemistry
+- winglian/evals
 inference: false
 ---
 # Minotaur MPT 7B
+Minotaur 7B is an instruct fine-tuned model on top of MPT-7B. Minotaur 7B is fine-tuned on only completely open datasets making this model reproducible by anyone.
 Questions, comments, feedback, looking to donate, or want to help? Reach out on our [Discord](https://discord.gg/PugNNHAF5r) or email [wing@openaccessaicollective.org](mailto:wing@openaccessaicollective.org)
 Minotaur 7B model is fine-tuned on the following datasets:
 - [WizardLM](https://huggingface.co/datasets/ehartford/WizardLM_alpaca_evol_instruct_70k_unfiltered)
 - [subset of QingyiSi/Alpaca-CoT for roleplay and CoT](https://huggingface.co/QingyiSi/Alpaca-CoT)
 - [GPTeacher-General-Instruct](https://huggingface.co/datasets/teknium/GPTeacher-General-Instruct)
 - [metaeval/ScienceQA_text_only](https://huggingface.co/datasets/metaeval/ScienceQA_text_only) - instruct for concise responses
 - [openai/summarize_from_feedback](https://huggingface.co/datasets/openai/summarize_from_feedback) - instruct augmented tl;dr summarization
 - [camel-ai/math](https://huggingface.co/datasets/camel-ai/math)
 - [camel-ai/physics](https://huggingface.co/datasets/camel-ai/physics)
 - [camel-ai/chemistry](https://huggingface.co/datasets/camel-ai/chemistry)
 - [camel-ai/biology](https://huggingface.co/datasets/camel-ai/biology)
+- [winglian/evals](https://huggingface.co/datasets/winglian/evals)
+  - custom sysnthetic datasets around misconceptions, in-context qa, jokes, N-tasks problems, and context-insensitivity
+  - ARC-Easy & ARC-Challenge - instruct augmented for detailed responses, derived from the `train` split
+  - [hellaswag](https://huggingface.co/datasets/hellaswag) - 30K+ rows of instruct augmented for detailed explanations w 30K+ rows, derived from the `train` split
+  - [riddle_sense](https://huggingface.co/datasets/riddle_sense) - instruct augmented
+  - [gsm8k](https://huggingface.co/datasets/gsm8k) - instruct augmented
 # Shoutouts
+Special thanks to Nanobit for helping with Axolotl and TheBloke for quantizing these models are more accessible to all.
 # Demo