traintogpb
/

llama-2-enko-translator-7b-qlora-adapter

text-generation

Model card Files Files and versions Community

traintogpb commited on Mar 14

Commit

6a245f2

•

1 Parent(s): 0d2c0ea

chore: fix model card

Files changed (1) hide show

README.md +0 -5

README.md CHANGED Viewed

@@ -17,12 +17,10 @@ pipeline_tag: translation
 ### Training Dataset
 - [traintogpb/aihub-flores-koen-integrated-sparta-30k](https://huggingface.co/datasets/traintogpb/aihub-flores-koen-integrated-sparta-30k)
 - Can translate in Enlgish-Korean (bi-directional)
 ### Prompt
 - Template:
   ```python
     prompt = f"Translate this from {src_lang} to {tgt_lang}\n### {src_lang}: {src_text}\n### {tgt_lang}:"
@@ -33,7 +31,6 @@ pipeline_tag: translation
 - Issue:
   The tokenizer of the model tokenizes the prompt below in different way with the prompt above.
   Make sure to use the prompt proposed above.
   ```python
     prompt = f"""Translate this from {src_lang} to {tgt_lang}
     ### {src_lang}: {src_text}
@@ -41,7 +38,6 @@ pipeline_tag: translation
     >>> # DO NOT USE this prompt
   ```
   And mind that there is no "space (`_`)" at the end of the prompt.
 ### Training
@@ -52,7 +48,6 @@ pipeline_tag: translation
 ### Usage (IMPORTANT)
 - Should remove the EOS token (`<|endoftext|>`, id=46332) at the end of the prompt.
   ```python
     # MODEL
     plm_name = 'beomi/open-llama-2-ko-7b'

 ### Training Dataset
 - [traintogpb/aihub-flores-koen-integrated-sparta-30k](https://huggingface.co/datasets/traintogpb/aihub-flores-koen-integrated-sparta-30k)
 - Can translate in Enlgish-Korean (bi-directional)
 ### Prompt
 - Template:
   ```python
     prompt = f"Translate this from {src_lang} to {tgt_lang}\n### {src_lang}: {src_text}\n### {tgt_lang}:"
 - Issue:
   The tokenizer of the model tokenizes the prompt below in different way with the prompt above.
   Make sure to use the prompt proposed above.
   ```python
     prompt = f"""Translate this from {src_lang} to {tgt_lang}
     ### {src_lang}: {src_text}
     >>> # DO NOT USE this prompt
   ```
   And mind that there is no "space (`_`)" at the end of the prompt.
 ### Training
 ### Usage (IMPORTANT)
 - Should remove the EOS token (`<|endoftext|>`, id=46332) at the end of the prompt.
   ```python
     # MODEL
     plm_name = 'beomi/open-llama-2-ko-7b'