Spaces:

ysharma
/

Chat_with_Meta_llama3_8b

Running on Zero

not-lain commited on Apr 22

Commit

873314a

•

1 Parent(s): 7019db3

add generation prompt

let's not allow AI to carry the entirety of the prompt including the generation prompt, some things are better set manually to avoid extreme cases

Files changed (1) hide show

app.py CHANGED Viewed

@@ -77,7 +77,7 @@ def chat_llama3_8b(message: str,
         conversation.extend([{"role": "user", "content": user}, {"role": "assistant", "content": assistant}])
     conversation.append({"role": "user", "content": message})
-    input_ids = tokenizer.apply_chat_template(conversation, return_tensors="pt").to(model.device)
     streamer = TextIteratorStreamer(tokenizer, timeout=10.0, skip_prompt=True, skip_special_tokens=True)

         conversation.extend([{"role": "user", "content": user}, {"role": "assistant", "content": assistant}])
     conversation.append({"role": "user", "content": message})
+    input_ids = tokenizer.apply_chat_template(conversation,add_generation_prompt=True, return_tensors="pt").to(model.device)
     streamer = TextIteratorStreamer(tokenizer, timeout=10.0, skip_prompt=True, skip_special_tokens=True)