openchat
/

openchat-3.6-8b-20240522

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

imone commited on May 24

Commit

aa960d8

•

1 Parent(s): 2b3725a

Update README.md

Files changed (1) hide show

README.md +25 -0

README.md CHANGED Viewed

@@ -106,6 +106,31 @@ messages = [
 tokens = tokenizer.apply_chat_template(messages, add_generation_prompt=True)
 ```
 <div align="center">
 <h2> Limitations </h2>
 </div>

 tokens = tokenizer.apply_chat_template(messages, add_generation_prompt=True)
 ```
+## Inference using Transformers
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+import torch
+model_id = "openchat/openchat-3.6-8b-20240522"
+tokenizer = AutoTokenizer.from_pretrained(model_id)
+model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.bfloat16, device_map="auto")
+messages = [
+    {"role": "user", "content": "Explain how large language models work in detail."},
+]
+input_ids = tokenizer.apply_chat_template(messages, add_generation_prompt=True, return_tensors="pt").to(model.device)
+outputs = model.generate(input_ids,
+    do_sample=True,
+    temperature=0.6,
+    top_p=0.9,
+)
+response = outputs[0][input_ids.shape[-1]:]
+print(tokenizer.decode(response, skip_special_tokens=True))
+```
 <div align="center">
 <h2> Limitations </h2>
 </div>