frankminors123
commited on
Commit
•
0c875c4
1
Parent(s):
ebfa1c1
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,12 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- mlabonne/CodeLlama-2-20k
|
5 |
+
language:
|
6 |
+
- zh
|
7 |
+
---
|
8 |
+
# Chinese-CodeLlama-7B-SFT
|
9 |
+
|
10 |
+
We implemented SFT based on our [Chinese-CodeLlama-7B-PT](https://huggingface.co/frankminors123/Chinese-CodeLlama-7B-PT). The dataset comes from [CodeLlama-2-20k](https://huggingface.co/datasets/mlabonne/CodeLlama-2-20k), we used Google Translate to translate it into Chinese.
|
11 |
+
|
12 |
+
In addition, we designed appropriate Chinese prompt template for coding tasks, and during the fine-tuning stage, `memory efficient attention` was applied which save us a lot of GPU memory space.
|