SurgeGlobal
/

OpenBezoar-HH-RLHF-DPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chansurgeplus commited on Apr 18, 2024

Commit

2f53cad

·

verified ·

1 Parent(s): fd847fc

Added usage

Files changed (1) hide show

README.md +33 -0

README.md CHANGED Viewed

@@ -45,6 +45,39 @@ Notice that **no** end-of-sentence (eos) token is being appended.
 *Note: The system prompt shown in the following figure is the one that the model has been trained on most of the time. However, you may attempt to use any other system prompt that is available in the [Orca](https://arxiv.org/abs/2306.02707) scheme.*
 ## Limitations
 - The model might not consistently show improved abilities to follow instructions, and it could respond inappropriately or get stuck in loops.

 *Note: The system prompt shown in the following figure is the one that the model has been trained on most of the time. However, you may attempt to use any other system prompt that is available in the [Orca](https://arxiv.org/abs/2306.02707) scheme.*
+## Usage
+```python
+from peft import PeftConfig, PeftModel
+from transformers import AutoModelForCausalLM, AutoTokenizer, GenerationConfig, AutoModelForSeq2SeqLM
+checkpoint =  "SurgeGlobal/OpenBezoar-SFT"
+tokenizer = AutoTokenizer.from_pretrained(checkpoint)
+model = AutoModelForCausalLM.from_pretrained(
+	checkpoint,
+	load_in_4bit=True, # optionally for low resource environments
+	device_map="auto"
+)
+prompt =  """### System:
+Below is an instruction that describes a task, optionally paired with an input that provides further context following that instruction. Write a response that appropriately completes the request.
+### Instruction:
+{instruction}
+### Response:""".format(
+	instruction="What is the world state in the year 1597."
+)
+inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+outputs = model.generate(**inputs, max_new_tokens=1024, do_sample=True)
+print(tokenizer.decode(outputs[0]))
+```
 ## Limitations
 - The model might not consistently show improved abilities to follow instructions, and it could respond inappropriately or get stuck in loops.