AIdenU
/

LLAMA-2-13b-ko-Y24-DPO_v2.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AIdenU commited on Feb 19

Commit

08bf8ba

•

1 Parent(s): 4845fc9

Create README.md

Files changed (1) hide show

README.md +26 -0

README.md ADDED Viewed

	@@ -0,0 +1,26 @@

+---
+license: apache-2.0
+language:
+- ko
+pipeline_tag: text-generation
+tags:
+- llama2
+---
+from transforemrs import AutoTokenizer, AutoModelForCausalLM
+model = AutoModelForCausalLM.from_pretrained("AIdenU/LLAMA-2-13b-ko-Y24-DPO_v2.1", device_map="auto")
+tokenizer = AutoTokenizer.from_pretrained("AIdenU/LLAMA-2-13b-ko-Y24-DPO_v2.1", use_fast=True)
+systemPrompt = "당신은 유능한 AI입니다."
+prompt = "지렁이도 밟으면 꿈틀하나요?"
+outputs = model.generate(
+  **tokenizer(
+    f"[INST] <<SYS>>\n{systemPrompt}\n<</SYS>>\n\n{prompt} [/INST] ",
+    return_tensors='pt'
+  ).to('cuda'),
+  max_new_tokens=256,
+  temperature=0.2,
+  top_p=1,
+  do_sample=True
+)
+print(tokenizer.decode(outputs[0]))