Update README.md
Browse files
README.md
CHANGED
|
@@ -11,10 +11,10 @@ tags:
|
|
| 11 |
- llama
|
| 12 |
- think
|
| 13 |
---
|
|
|
|
| 14 |
|
| 15 |

|
| 16 |
|
| 17 |
-
# MiniThink-1B-base
|
| 18 |
|
| 19 |
MiniThink-1B is an experiment to reproduce the "Aha!" moment in AI.
|
| 20 |
Is is trained using a modified version of the method used in the [Unsloth R1 training blog](https://unsloth.ai/blog/r1-reasoning) and the [notebook provided for training LLama 3.1 8B to learn R1 reasoning ](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-GRPO.ipynb).
|
|
|
|
| 11 |
- llama
|
| 12 |
- think
|
| 13 |
---
|
| 14 |
+
# MiniThink-1B-base
|
| 15 |
|
| 16 |

|
| 17 |
|
|
|
|
| 18 |
|
| 19 |
MiniThink-1B is an experiment to reproduce the "Aha!" moment in AI.
|
| 20 |
Is is trained using a modified version of the method used in the [Unsloth R1 training blog](https://unsloth.ai/blog/r1-reasoning) and the [notebook provided for training LLama 3.1 8B to learn R1 reasoning ](https://colab.research.google.com/github/unslothai/notebooks/blob/main/nb/Llama3.1_(8B)-GRPO.ipynb).
|