---
library_name: transformers
tags: []
---

- this model was trained to classify whether input text is chosen sentence or rejected text
- the probability (logits after passing softmax function) in last layer of this model can be used to quantify the preferencefrom user input
- fine-tuned [Rakuten/RakutenAI-7B-instruct](https://huggingface.co/Rakuten/RakutenAI-7B-instruct) via [LoRA](https://arxiv.org/abs/2106.09685) using [open-preference-v0.3](https://huggingface.co/datasets/ryota39/open_preference-v0.3)
- trained on bf16 format

## Metric

![image/png](https://cdn-uploads.huggingface.co/production/uploads/651e3f30ca333f3c8df692b8/-V76_Tj2ltNFn1YQWX8BJ.png)

- validation

|accuracy|recall|precision|f1-score|
|:---|:---|:---|:---|
|0.9694|0.9757|0.9636|0.9696|

- test

|accuracy|recall|precision|f1-score|
|:---|:---|:---|:---|
|x|x|x|x|