Teja-Gollapudi
commited on
Commit
•
714c19e
1
Parent(s):
a54e47e
Update README.md
Browse files
README.md
CHANGED
@@ -2,6 +2,7 @@
|
|
2 |
license: cc
|
3 |
datasets:
|
4 |
- VMware/open-instruct-v1-oasst-dolly-hhrlhf
|
|
|
5 |
language:
|
6 |
- en
|
7 |
library_name: transformers
|
@@ -11,11 +12,27 @@ pipeline_tag: text-generation
|
|
11 |
# VMware/xgen-7b-8k-open-instruct
|
12 |
Instruction-tuned version of SalesForce/Xgen-7b-8k-base. The model is open for <b>COMMERCIAL USE</b>. <br>
|
13 |
|
14 |
-
We expanded Open-instruct with additional commercially viable zero-shot COT datasets from Flan v2 (~70k).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
15 |
|
16 |
The model supports up to <b>8192 tokens </b>
|
17 |
|
18 |
<b> NOTE </b> : The model was trained using the Alpaca prompt template
|
|
|
19 |
|
20 |
|
21 |
## License
|
@@ -27,6 +44,10 @@ The model supports up to <b>8192 tokens </b>
|
|
27 |
|
28 |
## Use in Transformers
|
29 |
|
|
|
|
|
|
|
|
|
30 |
```
|
31 |
import os
|
32 |
import torch
|
|
|
2 |
license: cc
|
3 |
datasets:
|
4 |
- VMware/open-instruct-v1-oasst-dolly-hhrlhf
|
5 |
+
- conceptofmind/cot_submix_original
|
6 |
language:
|
7 |
- en
|
8 |
library_name: transformers
|
|
|
12 |
# VMware/xgen-7b-8k-open-instruct
|
13 |
Instruction-tuned version of SalesForce/Xgen-7b-8k-base. The model is open for <b>COMMERCIAL USE</b>. <br>
|
14 |
|
15 |
+
We expanded Open-instruct with additional commercially viable zero-shot COT datasets from Flan v2 (~70k). <br>
|
16 |
+
|
17 |
+
|
18 |
+
Open-instruct-v1
|
19 |
+
- Mosaic/Dolly-HHRLHF + filtered OASST1 - cc by 3.0
|
20 |
+
|
21 |
+
Subset of COT SUBMIX (FROM FLAN V2) Zeroshot examples
|
22 |
+
- ESNLI - MIT
|
23 |
+
- ECQA - permissible
|
24 |
+
- Strategy - MIT
|
25 |
+
- CREAK - MIT
|
26 |
+
- gsmk8 - MIT
|
27 |
+
- aqua - MIT
|
28 |
+
- qasc - Apache 2.0
|
29 |
+
|
30 |
+
<br>
|
31 |
|
32 |
The model supports up to <b>8192 tokens </b>
|
33 |
|
34 |
<b> NOTE </b> : The model was trained using the Alpaca prompt template
|
35 |
+
<b> NOTE </b> : tiktoken library is required for the tokenizer.
|
36 |
|
37 |
|
38 |
## License
|
|
|
44 |
|
45 |
## Use in Transformers
|
46 |
|
47 |
+
```
|
48 |
+
pip install tiktoken
|
49 |
+
```
|
50 |
+
|
51 |
```
|
52 |
import os
|
53 |
import torch
|