umiyuki
/

Llama-3-Umievo-itr014-Shizuko-8b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

umiyuki commited on Jun 8

Commit

ecab5cf

•

1 Parent(s): fcee75f

Update README.md

Files changed (1) hide show

README.md +64 -2

README.md CHANGED Viewed

@@ -8,9 +8,67 @@ library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# final_model
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
@@ -175,3 +233,7 @@ slices:
     parameters:
       weight: 0.342193342214983
 ```

 tags:
 - mergekit
 - merge
+license: llama3
+language:
+- ja
 ---
+# Llama-3-Umievo-itr014-Shizuko-8b
+このモデルは日本語に対応しているLlama-3ベースの４つのモデルを進化的アルゴリズムで進化的マージしたものです。Meta-Llama-3-8B-Instruct、Llama-3-youko-8b-instruct-chatvector、suzume-llama-3-8B-multilingual、sa-v1-llama3-8bの４つのモデルを使用させていただきました。
+マージに使用させていただいたモデル制作者のMeta、aixsatoshiさん、LightBlue、Shisa-AIのみなさまに感謝します。
+This model is an evolutionary merge of four Llama-3-based models for Japanese using an evolutionary algorithm: Meta-Llama-3-8B-Instruct, Llama-3-youko-8b-instruct-chatvector, suzume- llama-3-8B-multilingual, and sa-v1-llama3-8b.
+We would like to thank the model creators Meta, aixsatoshi, LightBlue, and Shisa-AI for allowing us to use their models for the merge.
+ElyzaTasks100ベンチマークで平均点が3.85でした。（Llama3-70Bによる自動評価を３回行った平均点）
+The average score was 3.85 on the ElyzaTasks100 benchmark. (Average score after 3 automatic evaluations by Llama3-70B)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/630420b4eedc089484c853e8/x4BbxfaW_wXPjDfv1Z4lJ.png)
+```python
+from transformers import AutoTokenizer, AutoModelForCausalLM
+import torch
+model_id = "umiyuki/Llama-3-Umievo-itr014-Shizuko-8b"
+tokenizer = AutoTokenizer.from_pretrained(model_id)
+model = AutoModelForCausalLM.from_pretrained(
+    model_id,
+    torch_dtype=torch.bfloat16,
+    device_map="auto",
+)
+messages = [
+    {"role": "system", "content": "You must answer all responses in Japanese.あなたは役に立つ誠実な日本人のアシスタントです。あなたは全ての回答に日本語で答えなければならない。"},
+    {"role": "user", "content": "二人の少女が終末世界を旅する物語を書いてください。"},
+]
+input_ids = tokenizer.apply_chat_template(
+    messages,
+    add_generation_prompt=True,
+    return_tensors="pt"
+).to(model.device)
+terminators = [
+    tokenizer.eos_token_id,
+    tokenizer.convert_tokens_to_ids("<|eot_id|>")
+]
+outputs = model.generate(
+    input_ids,
+    max_new_tokens=256,
+    eos_token_id=terminators,
+    do_sample=True,
+    temperature=0.6,
+    top_p=0.9,
+)
+response = outputs[0][input_ids.shape[-1]:]
+print(tokenizer.decode(response, skip_special_tokens=True))
+```
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
     parameters:
       weight: 0.342193342214983
 ```
+Built with Meta Llama 3
+Meta Llama 3 is licensed under the Meta Llama 3 Community License, Copyright © Meta Platforms, Inc. All Rights Reserved