zake7749
/

gemma-2-2b-it-chinese-kyara-dpo

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

zake7749 commited on Oct 13

Commit

d119373

•

1 Parent(s): 3013f0c

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -5,7 +5,11 @@ language:
 - zh
 - en
 library_name: transformers
-license: mit
 ---
@@ -110,7 +114,7 @@ We have collected a total of 3.6M conversations, approximately 4.51 billion toke
 ### Dataset Construction
-The data construction for Kyara is divided into two parts: English and Chinese. For the English part, we have incorporated multiple high-quality open-source datasets, such as `teknium/OpenHermes-2.5` and `arcee-ai/The-Tome`, and performing semantic deduplication to drop out near-similar examples. As for the Chinese part, the construction follows the process outlined below:
 #### Base Dataset: Knowledge Injection with Retrieval Augmentation

 - zh
 - en
 library_name: transformers
+datasets:
+- zake7749/kyara-chinese-math-sft-s0-30K
+- zake7749/kyara-chinese-preference-rl-dpo-s0-30K
+- zake7749/chinese-sft-stem-zh-hant
+- zake7749/chinese-sft-stem-zh-hans
 ---
 ### Dataset Construction
+The data construction for Kyara is divided into two parts: English and Chinese. For the English part, we have incorporated multiple high-quality open-source datasets, such as [teknium/OpenHermes-2.5](https://huggingface.co/datasets/teknium/OpenHermes-2.5) and [arcee-ai/The-Tome](https://huggingface.co/datasets/arcee-ai/The-Tome), and performing semantic deduplication to drop out near-similar examples. As for the Chinese part, the construction follows the process outlined below:
 #### Base Dataset: Knowledge Injection with Retrieval Augmentation