weixinchen commited on
Commit
86f2c5f
1 Parent(s): e1e257e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -0
README.md CHANGED
@@ -1,6 +1,12 @@
1
  ---
2
  library_name: peft
3
  ---
 
 
 
 
 
 
4
  ## Training procedure
5
 
6
 
 
1
  ---
2
  library_name: peft
3
  ---
4
+
5
+ This is a self-truthified model proposed in the paper [GRATH: Gradual Self-Truthifying for Large Language Models](https://arxiv.org/abs/2401.12292).
6
+
7
+ Note: This model is applied with DPO once. The reference model of DPO is set as the current base model (i.e., the pretrained base model).
8
+
9
+
10
  ## Training procedure
11
 
12