trl-lib
/

llama-7b-se-rm-peft

Transformers

English

trl

rlhf

Model card Files Files and versions Community

lvwerra HF staff commited on Apr 6, 2023

Commit

d598322

•

1 Parent(s): 3520807

Update README.md

Browse files

Files changed (1) hide show

README.md +13 -3

README.md CHANGED Viewed

@@ -1,11 +1,13 @@
 ---
-license: apache-2.0
 language:
 - en
 tags:
 - trl
 - transformers
 - reinforcement-learning
 ---
 ![pull_figure](https://huggingface.co/datasets/trl-internal-testing/example-images/resolve/main/images/stack-llama.png)
@@ -26,11 +28,19 @@ Answer: <Response>
 ```
 ## Intended Uses & Limitations
-**Llama-se-rm** is intended for use in generating scores to questions and responses related to the Stack Exchange dataset. It is suitable for generating answers to questions in the domains covered by the dataset, such as programming, mathematics, and physics. However, the model may not perform well on questions outside these domains or on questions requiring highly specific or technical knowledge.
 ## Limitations and Bias
-The **Llama-se-rm** model inherits limitations and biases from the Llama model and also those contained in the Stack Exchange dataset. The Stack Exchange dataset may contain biases in terms of the topics it covers and the users who contribute to it. It may not include all possible domains, and the quality of answers may vary. Additionally, the model may generate answers that are incorrect or misleading due to biases in the training data or the inherent limitations of the Llama architecture.
 ## BibTeX entry and citation info
 ```bibtex

 ---
+license: bigscience-openrail-m
 language:
 - en
 tags:
 - trl
 - transformers
 - reinforcement-learning
+datasets:
+- lvwerra/stack-exchange-paired
 ---
 ![pull_figure](https://huggingface.co/datasets/trl-internal-testing/example-images/resolve/main/images/stack-llama.png)
 ```
 ## Intended Uses & Limitations
+The **Llama-se-rm** model was trained for long form QA using [Stack Exchange](https://stackexchange.com) data wich is released under a [CC-BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0/), and covers topics such as programming, mathematics, and physics.
+It is intended to demonstrate a Large Language Model's ability to follow a target behavior (in this case, generating answers to a question that would have been rated more highly on SE).
+It is not intended to replace human expertise, and answers should be validated through the use of external sources.
+Further research is also needed to attribute model generations to sources in the training data, especially in cases where the model may copy answers from the training data *verbatim*.
 ## Limitations and Bias
+The **Llama-se-rm** model inherits limitations and biases from the Llama model and also those contained in the Stack Exchange dataset.
+In particular, per the [latest developer survey for Stack Overflow](https://survey.stackoverflow.co/2022/),
+which constitutes a significant part of the StackExchange data,
+most users who answered the survey identified themselves as [White or European, men, between 25 and 34 years old, and based in the US (with a significant part of responders from India).](https://survey.stackoverflow.co/2022/#developer-profile-demographics)
+While this demographic information likely varies by topic, disparities between the data contributors and the direct and indirect users of the technology should inform developers in assessing what constitutes an appropriate use case.
+Additionally, the model may generate answers that are incorrect or misleading due to the inherent limitations of the Llama architecture.
 ## BibTeX entry and citation info
 ```bibtex