Commit
•
cbb1bf8
1
Parent(s):
c50c6cc
Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ tags:
|
|
16 |
|
17 |
This is an experimental Reward Model trained with TRL using comparison data from the Dolly v2 dataset and generations from Falcon-7b-instruct.
|
18 |
|
19 |
-
For testing purposes, we have followed the **assumption that human written responses (written by Databricks employees) are preferred to those generated by Falcon**. This might not always be the case but you can setup a comparison data collection with Argilla to gather real feedback about preferred responses.
|
20 |
|
21 |
To use this model for scoring:
|
22 |
|
|
|
16 |
|
17 |
This is an experimental Reward Model trained with TRL using comparison data from the Dolly v2 dataset and generations from Falcon-7b-instruct.
|
18 |
|
19 |
+
For testing purposes, we have followed the **assumption that human written responses (written by Databricks employees) are preferred to those generated by Falcon**. This might not always be the case but you can setup a comparison data collection with [Argilla](https://docs.argilla.io/en/latest/guides/llms/conceptual_guides/conceptual_guides.html) to gather real feedback about preferred responses.
|
20 |
|
21 |
To use this model for scoring:
|
22 |
|