norabelrose
commited on
Commit
•
bcdc53a
1
Parent(s):
1c797af
Update README.md
Browse files
README.md
CHANGED
@@ -1,7 +1,7 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
-
|
5 |
|
6 |
Model code: https://github.com/state-spaces/mamba/tree/main
|
7 |
|
@@ -10,3 +10,44 @@ To load the model, follow the installation instruction in the code repo, and the
|
|
10 |
from mamba_ssm.models.mixer_seq_simple import MambaLMHeadModel
|
11 |
model = MambaLMHeadModel.from_pretrained("EleutherAI/Hermes-mamba-2.8b")
|
12 |
```
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
+
This is a 2.8b parameter RNN language model using the [Mamba](https://arxiv.org/abs/2312.00752) architecture, pretrained on the Pile corpus and fine-tuned on Nous Research's OpenHermes 2.5 chat dataset.
|
5 |
|
6 |
Model code: https://github.com/state-spaces/mamba/tree/main
|
7 |
|
|
|
10 |
from mamba_ssm.models.mixer_seq_simple import MambaLMHeadModel
|
11 |
model = MambaLMHeadModel.from_pretrained("EleutherAI/Hermes-mamba-2.8b")
|
12 |
```
|
13 |
+
|
14 |
+
The chat format is identical to the default HuggingFace format, **EXCEPT** that we replace the `<|im_start|>` and `<|im_end|>` special tokens with the unused ASCII codes `"\x16"` and `"\x17"` respectively. This was done for compatibility with our [RWKV model](https://huggingface.co/EleutherAI/Hermes-RWKV-v5-7B).
|
15 |
+
|
16 |
+
An example interaction:
|
17 |
+
```
|
18 |
+
\x16human
|
19 |
+
Write a poem about the beauty of mathematics in the style of Emily Dickinson.\x17
|
20 |
+
\x16assistant
|
21 |
+
In the heart of Numbers, so precise,
|
22 |
+
A Beauty dwells - both cool and nice.
|
23 |
+
An elegance that sings in sums,
|
24 |
+
And dances to algebraic drums.
|
25 |
+
|
26 |
+
Addition's grace, subtraction's might,
|
27 |
+
Multiplication's sheer delight,
|
28 |
+
Division's firm yet gentle hand,
|
29 |
+
All part of this enchanted land.
|
30 |
+
|
31 |
+
Infinity, a lover bold,
|
32 |
+
To whom the stars their secrets told,
|
33 |
+
While Zero, cloaked in mystery,
|
34 |
+
Unlocks each mathematical decree.
|
35 |
+
|
36 |
+
Fractals bloom like flowers wild,
|
37 |
+
Geometry, Nature's favored child,
|
38 |
+
Calculus, with steady gaze,
|
39 |
+
Maps out life's intricate maze.
|
40 |
+
|
41 |
+
Pi, an endless serenade,
|
42 |
+
In circles' arms forever laid,
|
43 |
+
And prime numbers,
|
44 |
+
standing lone,
|
45 |
+
Make the universe their own.
|
46 |
+
|
47 |
+
Each equation, theorem scribed,
|
48 |
+
In
|
49 |
+
parchment old or computer imbibed,
|
50 |
+
Is but a whisper from the skies,
|
51 |
+
A cosmic truth in disguise.\x17
|
52 |
+
```
|
53 |
+
In the above, newlines are printed without escaping, while ASCII codes are escaped for readability.
|