croissantllm
/

CroissantLLMChat-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

manu commited on Feb 5

Commit

46d96f8

•

1 Parent(s): 9cef06a

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ This model is part of the CroissantLLM initiative, and corresponds to the checkp
 https://arxiv.org/abs/2402.00786
-For best performance, it should be used with a temperature of above 0.4, and with the exact template described below:
 ```python
 chat = [
@@ -85,7 +85,7 @@ chat = [
 chat_input = tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=True)
 inputs = tokenizer(chat_input, return_tensors="pt", add_special_tokens=True).to(model.device)
-tokens = model.generate(**inputs, max_new_tokens=150, do_sample=True, top_p=0.95, top_k=60, temperature=0.5)
 print(tokenizer.decode(tokens[0]))
 ```

 https://arxiv.org/abs/2402.00786
+For best performance, it should be used with a temperature of 0.3 or more, and with the exact template described below:
 ```python
 chat = [
 chat_input = tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=True)
 inputs = tokenizer(chat_input, return_tensors="pt", add_special_tokens=True).to(model.device)
+tokens = model.generate(**inputs, max_new_tokens=150, do_sample=True, top_p=0.95, top_k=60, temperature=0.3)
 print(tokenizer.decode(tokens[0]))
 ```