Format dataset types
Browse files
README.md
CHANGED
@@ -35,31 +35,31 @@ Go ahead and axolotl questions!!
|
|
35 |
|
36 |
Have a dataset in one of the following format (JSONL recommended):
|
37 |
|
38 |
-
- alpaca
|
39 |
```json
|
40 |
{"instruction": "...", "input": "...", "output": "..."}
|
41 |
```
|
42 |
-
- jeopardy
|
43 |
```json
|
44 |
{"question": "...", "category": "...", "answer": "..."}
|
45 |
```
|
46 |
-
- oasst
|
47 |
```json
|
48 |
{"INSTRUCTION": "...", "RESPONSE": "..."}
|
49 |
```
|
50 |
-
- gpteacher
|
51 |
```json
|
52 |
{"instruction": "...", "input": "...", "response": "..."}
|
53 |
```
|
54 |
-
- reflection
|
55 |
```json
|
56 |
{"instruction": "...", "input": "...", "output": "...", "reflection": "...", "corrected": "..."}
|
57 |
```
|
58 |
-
- sharegpt
|
59 |
```json
|
60 |
{"conversations": [{"from": "...", "value": "..."}]}
|
61 |
```
|
62 |
-
- completion
|
63 |
```json
|
64 |
{"text": "..."}
|
65 |
```
|
|
|
35 |
|
36 |
Have a dataset in one of the following format (JSONL recommended):
|
37 |
|
38 |
+
- `alpaca`: instruction; input(optional)
|
39 |
```json
|
40 |
{"instruction": "...", "input": "...", "output": "..."}
|
41 |
```
|
42 |
+
- `jeopardy`: question and answer
|
43 |
```json
|
44 |
{"question": "...", "category": "...", "answer": "..."}
|
45 |
```
|
46 |
+
- `oasst`: instruction
|
47 |
```json
|
48 |
{"INSTRUCTION": "...", "RESPONSE": "..."}
|
49 |
```
|
50 |
+
- `gpteacher`: instruction; input(optional)
|
51 |
```json
|
52 |
{"instruction": "...", "input": "...", "response": "..."}
|
53 |
```
|
54 |
+
- `reflection`: instruction with reflect; input(optional)
|
55 |
```json
|
56 |
{"instruction": "...", "input": "...", "output": "...", "reflection": "...", "corrected": "..."}
|
57 |
```
|
58 |
+
- `sharegpt`: conversations
|
59 |
```json
|
60 |
{"conversations": [{"from": "...", "value": "..."}]}
|
61 |
```
|
62 |
+
- `completion`: raw corpus
|
63 |
```json
|
64 |
{"text": "..."}
|
65 |
```
|