frankminors123's picture
Update README.md
454e3ac
|
raw
history blame
844 Bytes
metadata
license: apache-2.0
datasets:
  - mlabonne/CodeLlama-2-20k
language:
  - zh

Chinese-CodeLlama-7B-SFT

We implemented SFT based on our Chinese-CodeLlama-7B-PT. The dataset comes from CodeLlama-2-20k, we used Google Translate to translate it into Chinese.

In addition, we designed appropriate Chinese prompt template for coding tasks, and during the fine-tuning stage, memory efficient attention was applied which save us a lot of GPU memory space.

The Chinese prompt template is shown below:

''' 下面是描述一项任务的指令,并且与一则输入配对用来提供更多的上下文。请给出尽可能满足请求的回答.\n" "### 指令:\n{instruction}\n### 输入:\n{input}\n### 回答:\n" '''