dvilasuero HF staff commited on
Commit
cbb1bf8
1 Parent(s): c50c6cc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -16,7 +16,7 @@ tags:
16
 
17
  This is an experimental Reward Model trained with TRL using comparison data from the Dolly v2 dataset and generations from Falcon-7b-instruct.
18
 
19
- For testing purposes, we have followed the **assumption that human written responses (written by Databricks employees) are preferred to those generated by Falcon**. This might not always be the case but you can setup a comparison data collection with Argilla to gather real feedback about preferred responses.
20
 
21
  To use this model for scoring:
22
 
 
16
 
17
  This is an experimental Reward Model trained with TRL using comparison data from the Dolly v2 dataset and generations from Falcon-7b-instruct.
18
 
19
+ For testing purposes, we have followed the **assumption that human written responses (written by Databricks employees) are preferred to those generated by Falcon**. This might not always be the case but you can setup a comparison data collection with [Argilla](https://docs.argilla.io/en/latest/guides/llms/conceptual_guides/conceptual_guides.html) to gather real feedback about preferred responses.
20
 
21
  To use this model for scoring:
22