sauc-abadal-lloret
commited on
Commit
•
cd943c7
1
Parent(s):
e2d0d1e
Update README.md
Browse files
README.md
CHANGED
@@ -27,7 +27,7 @@ In particular, the **ALT-RM** checkpoint collects the feedback by leveraging a [
|
|
27 |
'QUANTILE 3': 'Bad.',
|
28 |
'QUANTILE 4': 'Horrible.'}
|
29 |
```
|
30 |
-
Thus, at inference time, the expected aligned behavior can be attained by conditioning the input with the
|
31 |
|
32 |
**Related Models:** [ALT-Quark](https://huggingface.co/sauc-abadal-lloret/gpt-j-6b-ALT-Quark-tldr).
|
33 |
|
|
|
27 |
'QUANTILE 3': 'Bad.',
|
28 |
'QUANTILE 4': 'Horrible.'}
|
29 |
```
|
30 |
+
Thus, at inference time, the expected aligned behavior can be attained by conditioning the input with the `Excellent.` feedback.
|
31 |
|
32 |
**Related Models:** [ALT-Quark](https://huggingface.co/sauc-abadal-lloret/gpt-j-6b-ALT-Quark-tldr).
|
33 |
|