Finnish-NLP
/

Ahma-3B-Instruct

@@ -39,7 +39,7 @@ This model was fine-tuned for instruction following. Instruction-tuned models ar
 ### How to use
-If you want to use this model for instruction-following, you need to use the same prompt format we used in the fine-tuning process (basically the same format what Meta used in their Llama2 models). **Note: do not use "LlamaTokenizer" from transformers library but always use the AutoTokenizer instead, or use the plain sentencepiece tokenizer.** Here is an example using the instruction-following prompt format, with some generation arguments you can modify for your use:
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
@@ -47,19 +47,11 @@ from transformers import AutoTokenizer, AutoModelForCausalLM
 system_prompt = "Olet tekoälyavustaja. Vastaat aina mahdollisimman avuliaasti. Vastauksesi eivät saa sisältää mitään haitallista, epäeettistä, rasistista, seksististä, vaarallista tai laitonta sisältöä. Jos kysymyksessä ei ole mitään järkeä tai se ei ole asiasisällöltään johdonmukainen, selitä miksi sen sijaan, että vastaisit jotain väärin. Jos et tiedä vastausta kysymykseen, älä kerro väärää tietoa."
-def format_prompt(prompt: str) -> str:
-    prompt = f" [INST] <<SYS>>\n{system_prompt.strip()}\n<</SYS>>\n\n{prompt.strip()} [/INST] "
-    return prompt
 tokenizer = AutoTokenizer.from_pretrained("Finnish-NLP/Ahma-3B-Instruct")
 model = AutoModelForCausalLM.from_pretrained("Finnish-NLP/Ahma-3B-Instruct")
 model = model.to("cuda")
-# use the custom prompt format function or the chat template feature in the tokenizer to format your inputs
-# prompt = format_prompt("Kerro kolme hyötyä, joita pienet avoimen lähdekoodin kielimallit tuovat?")
-# inputs = tokenizer(prompt, return_tensors="pt")
 messages = [
     {
@@ -100,6 +92,8 @@ You may experiment with different system prompt instructions too if you like.
 ### Limitations and bias
 The training data used for this model contains a lot of content from the internet, which is far from neutral. Therefore, the model can have biased predictions. This bias will also affect all fine-tuned versions of this model.
 ## Training data

 ### How to use
+If you want to use this model for instruction-following, you need to use the same prompt format we used in the fine-tuning process (basically the same format what Meta used in their Llama2 models). **Note: do not use "LlamaTokenizer" from transformers library but always use the AutoTokenizer instead, or use the plain sentencepiece tokenizer.** Here is an example using the instruction-following prompt format with the tokenizer's built-in chat template feature which makes it easy to format your potential multi-turn chats too, with some generation arguments you can modify for your use:
 ```python
 from transformers import AutoTokenizer, AutoModelForCausalLM
 system_prompt = "Olet tekoälyavustaja. Vastaat aina mahdollisimman avuliaasti. Vastauksesi eivät saa sisältää mitään haitallista, epäeettistä, rasistista, seksististä, vaarallista tai laitonta sisältöä. Jos kysymyksessä ei ole mitään järkeä tai se ei ole asiasisällöltään johdonmukainen, selitä miksi sen sijaan, että vastaisit jotain väärin. Jos et tiedä vastausta kysymykseen, älä kerro väärää tietoa."
 tokenizer = AutoTokenizer.from_pretrained("Finnish-NLP/Ahma-3B-Instruct")
 model = AutoModelForCausalLM.from_pretrained("Finnish-NLP/Ahma-3B-Instruct")
 model = model.to("cuda")
+# use the chat template feature in the tokenizer to format your (multi-turn) inputs
 messages = [
     {
 ### Limitations and bias
+This model was trained only with Finnish texts excluding code so it should not be used for multilingual and code generation use cases.
 The training data used for this model contains a lot of content from the internet, which is far from neutral. Therefore, the model can have biased predictions. This bias will also affect all fine-tuned versions of this model.
 ## Training data