Teja-Gollapudi commited on
Commit
714c19e
1 Parent(s): a54e47e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -1
README.md CHANGED
@@ -2,6 +2,7 @@
2
  license: cc
3
  datasets:
4
  - VMware/open-instruct-v1-oasst-dolly-hhrlhf
 
5
  language:
6
  - en
7
  library_name: transformers
@@ -11,11 +12,27 @@ pipeline_tag: text-generation
11
  # VMware/xgen-7b-8k-open-instruct
12
  Instruction-tuned version of SalesForce/Xgen-7b-8k-base. The model is open for <b>COMMERCIAL USE</b>. <br>
13
 
14
- We expanded Open-instruct with additional commercially viable zero-shot COT datasets from Flan v2 (~70k). (TODO: List out the datasets) <br>
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
 
16
  The model supports up to <b>8192 tokens </b>
17
 
18
  <b> NOTE </b> : The model was trained using the Alpaca prompt template
 
19
 
20
 
21
  ## License
@@ -27,6 +44,10 @@ The model supports up to <b>8192 tokens </b>
27
 
28
  ## Use in Transformers
29
 
 
 
 
 
30
  ```
31
  import os
32
  import torch
 
2
  license: cc
3
  datasets:
4
  - VMware/open-instruct-v1-oasst-dolly-hhrlhf
5
+ - conceptofmind/cot_submix_original
6
  language:
7
  - en
8
  library_name: transformers
 
12
  # VMware/xgen-7b-8k-open-instruct
13
  Instruction-tuned version of SalesForce/Xgen-7b-8k-base. The model is open for <b>COMMERCIAL USE</b>. <br>
14
 
15
+ We expanded Open-instruct with additional commercially viable zero-shot COT datasets from Flan v2 (~70k). <br>
16
+
17
+
18
+ Open-instruct-v1
19
+ - Mosaic/Dolly-HHRLHF + filtered OASST1 - cc by 3.0
20
+
21
+ Subset of COT SUBMIX (FROM FLAN V2) Zeroshot examples
22
+ - ESNLI - MIT
23
+ - ECQA - permissible
24
+ - Strategy - MIT
25
+ - CREAK - MIT
26
+ - gsmk8 - MIT
27
+ - aqua - MIT
28
+ - qasc - Apache 2.0
29
+
30
+ <br>
31
 
32
  The model supports up to <b>8192 tokens </b>
33
 
34
  <b> NOTE </b> : The model was trained using the Alpaca prompt template
35
+ <b> NOTE </b> : tiktoken library is required for the tokenizer.
36
 
37
 
38
  ## License
 
44
 
45
  ## Use in Transformers
46
 
47
+ ```
48
+ pip install tiktoken
49
+ ```
50
+
51
  ```
52
  import os
53
  import torch