VMware
/

xgen-7b-8k-open-instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Teja-Gollapudi commited on Jul 3, 2023

Commit

714c19e

•

1 Parent(s): a54e47e

Update README.md

Files changed (1) hide show

README.md +22 -1

README.md CHANGED Viewed

@@ -2,6 +2,7 @@
 license: cc
 datasets:
 - VMware/open-instruct-v1-oasst-dolly-hhrlhf
 language:
 - en
 library_name: transformers
@@ -11,11 +12,27 @@ pipeline_tag: text-generation
 # VMware/xgen-7b-8k-open-instruct
 Instruction-tuned version of SalesForce/Xgen-7b-8k-base. The model is open for <b>COMMERCIAL USE</b>. <br>
-We expanded Open-instruct with additional commercially viable zero-shot COT datasets from Flan v2 (~70k). (TODO: List out the datasets) <br>
 The model supports up to <b>8192 tokens </b>
 <b> NOTE </b> : The model was trained using the Alpaca prompt template
 ## License
@@ -27,6 +44,10 @@ The model supports up to <b>8192 tokens </b>
 ## Use in Transformers
 ```
 import os
 import torch

 license: cc
 datasets:
 - VMware/open-instruct-v1-oasst-dolly-hhrlhf
+- conceptofmind/cot_submix_original
 language:
 - en
 library_name: transformers
 # VMware/xgen-7b-8k-open-instruct
 Instruction-tuned version of SalesForce/Xgen-7b-8k-base. The model is open for <b>COMMERCIAL USE</b>. <br>
+We expanded Open-instruct with additional commercially viable zero-shot COT datasets from Flan v2 (~70k). <br>
+Open-instruct-v1
+- Mosaic/Dolly-HHRLHF + filtered  OASST1 - cc by 3.0
+Subset of COT SUBMIX (FROM FLAN V2) Zeroshot examples
+- ESNLI -  MIT
+- ECQA  - permissible
+- Strategy  - MIT
+- CREAK  - MIT
+- gsmk8 - MIT
+- aqua  - MIT
+- qasc  - Apache 2.0
+ <br>
 The model supports up to <b>8192 tokens </b>
 <b> NOTE </b> : The model was trained using the Alpaca prompt template
+<b> NOTE </b> : tiktoken library is required for the tokenizer.
 ## License
 ## Use in Transformers
+```
+pip install tiktoken
+```
 ```
 import os
 import torch