Update README.md
Browse files
README.md
CHANGED
@@ -32,3 +32,17 @@ Final checkpoint: RWKV-4-Pile-7B-20221115-8047.pth : Trained on the Pile for 332
|
|
32 |
* PIQA acc 76.06%
|
33 |
* SC2016 acc 73.44%
|
34 |
* Hellaswag acc_norm 65.51%
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
* PIQA acc 76.06%
|
33 |
* SC2016 acc 73.44%
|
34 |
* Hellaswag acc_norm 65.51%
|
35 |
+
|
36 |
+
### Instruct-test models: only useful if you construct your prompt following dataset templates
|
37 |
+
|
38 |
+
RWKV-4-Pile-7B-Instruct-test1
|
39 |
+
instruct-tuned on https://huggingface.co/datasets/bigscience/xP3all/viewer/en/train
|
40 |
+
|
41 |
+
RWKV-4-Pile-7B-Instruct-test2
|
42 |
+
instruct-tuned on https://huggingface.co/datasets/Muennighoff/flan & NIv2
|
43 |
+
|
44 |
+
### Chinese models
|
45 |
+
|
46 |
+
RWKV-4-Pile-7B-EngChn-testNovel-xxx for writing Chinese novels (trained on 200G Chinese novels.)
|
47 |
+
|
48 |
+
RWKV-4-Pile-7B-EngChn-testxxx for Chinese Q&A (trained on 10G Chinese text. only for testing purposes.)
|