hamel commited on
Commit
52c83d3
1 Parent(s): d113331

Update rlhf.md (#1237) [skip ci]

Browse files
Files changed (1) hide show
  1. docs/rlhf.md +2 -2
docs/rlhf.md CHANGED
@@ -12,8 +12,8 @@ feedback. Various methods include, but not limited to:
12
 
13
  ### RLHF using Axolotl
14
 
15
- [!IMPORTANT]
16
- This is a BETA feature and many features are not fully implemented. You are encouraged to open new PRs to improve the integration and functionality.
17
 
18
  The various RL training methods are implemented in trl and wrapped via axolotl. Below are various examples with how you can use various preference datasets to train models that use ChatML
19
 
 
12
 
13
  ### RLHF using Axolotl
14
 
15
+ >[!IMPORTANT]
16
+ >This is a BETA feature and many features are not fully implemented. You are encouraged to open new PRs to improve the integration and functionality.
17
 
18
  The various RL training methods are implemented in trl and wrapped via axolotl. Below are various examples with how you can use various preference datasets to train models that use ChatML
19