yuyijiong commited on
Commit
5a2db8d
·
verified ·
1 Parent(s): aae485e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -3
README.md CHANGED
@@ -1,3 +1,13 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - opencsg/UltraFeedback-chinese
5
+ language:
6
+
7
+ - zh
8
+ base_model:
9
+ - opencsg/csg-wukong-smoltalk-chinese
10
+ ---
11
+ * We apply DPO to ``opencsg/csg-wukong-2b-smoltalk-chinese``, using ``ultrafeedback-chinese-binarized-lowest`` as the DPO dataset.
12
+ * Train 2 epochs, learning rate=5e-6, batchsize=128
13
+ * This model's evaluation score is higher than that one trained on ``ultrafeedback-chinese-binarized``