THUDM
/

BPO

Text Generation

text-generation-inference

Model card Files Files and versions Community

CCCCCC commited on Nov 20, 2023

Commit

84407e7

•

1 Parent(s): f7d01eb

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ BPO is a black-box alignment technique that differs from training-based methods
 ### Data
 Prompt优化模型由隐含人类偏好特征的prompt优化对训练得到，数据集的详细信息在这里。
-The Prompt Optimization Model is trained on prompt optimization pairs which contain human preference features. Detailed information on the dataset can be found [here](https://huggingface.co/datasets/CCCCCC/BPO).
 ### Backbone Model
 The prompt preference optimizer is built on `Llama-2-7b-chat-hf`.

 ### Data
 Prompt优化模型由隐含人类偏好特征的prompt优化对训练得到，数据集的详细信息在这里。
+The Prompt Optimization Model is trained on prompt optimization pairs which contain human preference features. Detailed information on the dataset can be found [here](https://huggingface.co/datasets/THUDM/BPO).
 ### Backbone Model
 The prompt preference optimizer is built on `Llama-2-7b-chat-hf`.