Text Generation
Transformers
PyTorch
Safetensors
English
llama
text-generation-inference
Inference Endpoints
chansurgeplus commited on
Commit
fd847fc
1 Parent(s): 8c2f4c8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -30,17 +30,21 @@ OpenBezoar-HH-RLHF-SFT is an LLM that is built upon the OpenLLaMA 3B v2 architec
30
 
31
  ## Instruction Format
32
 
33
- We follow the typical format for instruction-based prompt templates, with a system prompt followed up by the user prompt. Both begins with a prefix and ends with two newline characters as described below. It is important to utilize this template in order to obtain best responses for instruction fine-tuning related tasks.
34
  ```
35
- ### System: {system}
 
36
 
37
- ### Instruction: {instruction}
 
38
 
39
  ### Response:
40
  ```
41
 
42
  Notice that **no** end-of-sentence (eos) token is being appended.
43
 
 
 
44
  ## Limitations
45
 
46
  - The model might not consistently show improved abilities to follow instructions, and it could respond inappropriately or get stuck in loops.
 
30
 
31
  ## Instruction Format
32
 
33
+ We follow a modified version of the Alpaca prompt template as shown below. It is important to utilize this template in order to obtain best responses for instruction related tasks.
34
  ```
35
+ ### System:
36
+ Below is an instruction that describes a task, optionally paired with an input that provides further context following that instruction. Write a response that appropriately completes the request.
37
 
38
+ ### Instruction:
39
+ {instruction}
40
 
41
  ### Response:
42
  ```
43
 
44
  Notice that **no** end-of-sentence (eos) token is being appended.
45
 
46
+ *Note: The system prompt shown in the following figure is the one that the model has been trained on most of the time. However, you may attempt to use any other system prompt that is available in the [Orca](https://arxiv.org/abs/2306.02707) scheme.*
47
+
48
  ## Limitations
49
 
50
  - The model might not consistently show improved abilities to follow instructions, and it could respond inappropriately or get stuck in loops.