frankminors123
/

Chinese-CodeLlama-7B-SFT-V1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

frankminors123 commited on Oct 4, 2023

Commit

0c875c4

•

1 Parent(s): ebfa1c1

Create README.md

Files changed (1) hide show

README.md +12 -0

README.md ADDED Viewed

	@@ -0,0 +1,12 @@

+---
+license: apache-2.0
+datasets:
+- mlabonne/CodeLlama-2-20k
+language:
+- zh
+---
+# Chinese-CodeLlama-7B-SFT
+We implemented SFT based on our [Chinese-CodeLlama-7B-PT](https://huggingface.co/frankminors123/Chinese-CodeLlama-7B-PT). The dataset comes from [CodeLlama-2-20k](https://huggingface.co/datasets/mlabonne/CodeLlama-2-20k), we used Google Translate to translate it into Chinese.
+In addition, we designed appropriate Chinese prompt template for coding tasks, and during the fine-tuning stage, `memory efficient attention` was applied which save us a lot of GPU memory space.