Teja-Gollapudi
commited on
Commit
•
480b107
1
Parent(s):
5150184
Update README.md
Browse files
README.md
CHANGED
@@ -12,6 +12,9 @@ pipeline_tag: text-generation
|
|
12 |
# VMware/xgen-7b-8k-open-instruct
|
13 |
Instruction-tuned version of SalesForce/Xgen-7b-8k-base. The model is open for <b>COMMERCIAL USE</b>. <br>
|
14 |
|
|
|
|
|
|
|
15 |
We expanded Open-instruct with additional commercially viable zero-shot COT datasets from Flan v2 (~70k). <br>
|
16 |
|
17 |
|
@@ -31,8 +34,6 @@ Subset of COT SUBMIX (FROM FLAN V2) Zeroshot examples
|
|
31 |
|
32 |
The model supports up to <b>8192 tokens </b>
|
33 |
|
34 |
-
<b> NOTE </b> : The model was trained using the Alpaca prompt template
|
35 |
-
<b> NOTE </b> : tiktoken library is required for the tokenizer. Set trust_remote_code=True when launching the tokenizer.
|
36 |
|
37 |
|
38 |
## License
|
|
|
12 |
# VMware/xgen-7b-8k-open-instruct
|
13 |
Instruction-tuned version of SalesForce/Xgen-7b-8k-base. The model is open for <b>COMMERCIAL USE</b>. <br>
|
14 |
|
15 |
+
<b> NOTE </b> : The model was trained using the Alpaca prompt template <br>
|
16 |
+
<b> NOTE </b> : tiktoken library is required for the tokenizer. Set trust_remote_code=True when launching the tokenizer.<br>
|
17 |
+
|
18 |
We expanded Open-instruct with additional commercially viable zero-shot COT datasets from Flan v2 (~70k). <br>
|
19 |
|
20 |
|
|
|
34 |
|
35 |
The model supports up to <b>8192 tokens </b>
|
36 |
|
|
|
|
|
37 |
|
38 |
|
39 |
## License
|