Update README.md
Browse files
README.md
CHANGED
@@ -1,2 +1,2 @@
|
|
1 |
This is a 3-way classifier judge model fine-tuned on the Chatbot Arena human preference dataset. The base model is llama 13B.
|
2 |
-
More details can be found in the Appendix. F of this [paper](
|
|
|
1 |
This is a 3-way classifier judge model fine-tuned on the Chatbot Arena human preference dataset. The base model is llama 13B.
|
2 |
+
More details can be found in the Appendix. F of this [paper](https://arxiv.org/abs/2306.05685).
|