theblackcat102
/

electra-large-reward-model

Text Classification

Inference Endpoints

Model card Files Files and versions Community

theblackcat102 commited on Jan 1, 2023

Commit

0ce3442

•

1 Parent(s): 97b803e

Update README.md

Files changed (1) hide show

README.md +12 -2

README.md CHANGED Viewed

@@ -1,8 +1,18 @@
 ---
-license: mit
 ---
 Reward Model pretrained on openai/webgpt_comparison and humanfeedback summary. Unlike the other electra-large model this model is trained using rank loss with one more datasets.
 On validation dataset the result is much more stable than usual.

 ---
+language:
+  - en
+tags:
+  - webgpt
+  - regression
+  - reward-model
+license: apache-2.0
+datasets:
+  - openai/webgpt_comparisons
+  - Tristan/summarize_from_feedback
+metrics:
+  - accuracy
 ---
 Reward Model pretrained on openai/webgpt_comparison and humanfeedback summary. Unlike the other electra-large model this model is trained using rank loss with one more datasets.
 On validation dataset the result is much more stable than usual.