togepi55
/

llm-jp-3-13b-it

PEFT

Safetensors

text-generation-inference

llama

trl

Model card Files Files and versions Community

togepi55 commited on Nov 30, 2024

Commit

03db82b

verified ·

1 Parent(s): f4f24e9

Upload README.md

Browse files

Files changed (1) hide show

README.md +6 -56

README.md CHANGED Viewed

@@ -9,46 +9,22 @@ license: apache-2.0
 ---
 # Model Card for Model ID
-<!-- Provide a quick summary of what the model is/does. -->
-## Model Details
-### Model Description
-<!-- Provide a longer summary of what this model is. -->
 - **Developed by:** togepi55
 - **Funded by :** llm-jp/llm-jp-3-13b
-- **Language(s) (NLP):** en, ja
 - **License:** apache-2.0
-- **Finetuned from model [optional]:** [More Information Needed]
-### Model Sources [optional]
-<!-- Provide the basic links for the model. -->
-- **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
-- **Demo [optional]:** [More Information Needed]
-## Uses
-<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 ### 注意
 プロンプトは形式でのみ学習しています。
 ~~~
-"<s>以下は、タスクを説明する指示です。要求を適切に満たす応答を書きなさい
 ### 指示:
 {instruction}
-### 応答:"
 ~~~
 ### サンプルコード
@@ -77,7 +53,6 @@ model = AutoModelForCausalLM.from_pretrained(
             BASE_MODEL,
             device_map="auto",
             quantization_config=bnb_config,
-            #torch_dtype=torch.bfloat16,
             torch_dtype="auto",
             trust_remote_code=True,
         )
@@ -102,7 +77,6 @@ with torch.no_grad():
               pad_token_id=tokenizer.pad_token_id,
               eos_token_id=tokenizer.eos_token_id,
               do_sample=False,
-              #num_return_sequences=3,
               streamer=streamer,
               repetition_penalty=1.02,
           )
@@ -114,34 +88,10 @@ with torch.no_grad():
-### Direct Use
-<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
-[More Information Needed]
-### Downstream Use [optional]
-<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->
-[More Information Needed]
-### Out-of-Scope Use
-<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
-[More Information Needed]
 ## Bias, Risks, and Limitations
-<!-- This section is meant to convey both technical and sociotechnical limitations. -->
 RLHF，DPOを実施していないため不適切な表現が出力される可能性があります。
-## Training Details
-### Training Data
 指示チューニングデータとして下記のものを利用しました。
 * ichikara-instruction-003-001-1.json
 * ichikara-instruction-003-002-1.json

 ---
 # Model Card for Model ID
 - **Developed by:** togepi55
 - **Funded by :** llm-jp/llm-jp-3-13b
+- **Language(s) (NLP):** English, Japanese
 - **License:** apache-2.0
 ### 注意
 プロンプトは形式でのみ学習しています。
 ~~~
+"""
+<s>以下は、タスクを説明する指示です。要求を適切に満たす応答を書きなさい
 ### 指示:
 {instruction}
+### 応答:
+"""
 ~~~
 ### サンプルコード
             BASE_MODEL,
             device_map="auto",
             quantization_config=bnb_config,
             torch_dtype="auto",
             trust_remote_code=True,
         )
               pad_token_id=tokenizer.pad_token_id,
               eos_token_id=tokenizer.eos_token_id,
               do_sample=False,
               streamer=streamer,
               repetition_penalty=1.02,
           )
 ## Bias, Risks, and Limitations
 RLHF，DPOを実施していないため不適切な表現が出力される可能性があります。
+### Training Details
 指示チューニングデータとして下記のものを利用しました。
 * ichikara-instruction-003-001-1.json
 * ichikara-instruction-003-002-1.json