argilla
/

roberta-base-reward-model-falcon-dolly

Text Classification

Inference Endpoints

Model card Files Files and versions Community

dvilasuero HF staff commited on May 31, 2023

Commit

cbb1bf8

•

1 Parent(s): c50c6cc

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ tags:
 This is an experimental Reward Model trained with TRL using comparison data from the Dolly v2 dataset and generations from Falcon-7b-instruct.
-For testing purposes, we have followed the **assumption that human written responses (written by Databricks employees) are preferred to those generated by Falcon**. This might not always be the case but you can setup a comparison data collection with Argilla to gather real feedback about preferred responses.
 To use this model for scoring:

 This is an experimental Reward Model trained with TRL using comparison data from the Dolly v2 dataset and generations from Falcon-7b-instruct.
+For testing purposes, we have followed the **assumption that human written responses (written by Databricks employees) are preferred to those generated by Falcon**. This might not always be the case but you can setup a comparison data collection with [Argilla](https://docs.argilla.io/en/latest/guides/llms/conceptual_guides/conceptual_guides.html) to gather real feedback about preferred responses.
 To use this model for scoring: