t-bank-ai
/

response-quality-classifier-large

Text Classification

Inference Endpoints

Model card Files Files and versions Community

egoriya commited on May 31, 2022

Commit

b772afd

•

1 Parent(s): a02a899

Update README.md

Files changed (1) hide show

README.md +39 -0

README.md CHANGED Viewed

@@ -1,3 +1,42 @@
 ---
 license: mit
 ---

 ---
 license: mit
 ---
+This classification model is based on [sberbank-ai/ruRoberta-large](https://huggingface.co/sberbank-ai/ruRoberta-large).
+The model should be used to produce relevance and specificity of the last message in the context of a dialog.
+It is pretrained on corpus of dialog data from social networks and finetuned on [tinkoff-ai/context_similarity](https://huggingface.co/tinkoff-ai/context_similarity).
+The performance of the model on validation split [tinkoff-ai/context_similarity](https://huggingface.co/tinkoff-ai/context_similarity) (with the best thresholds for validation samples):
+<table>
+    <thead>
+        <tr>
+            <td colspan="2">relevance</td>
+            <td colspan="2">specificity</td>
+        </tr>
+    </thead>
+    <tbody>
+        <tr>
+            <td>f0.5</td>
+            <td>roc-auc</td>
+            <td>f0.5</td>
+            <td>roc-auc</td>
+        </tr>
+        <tr>
+            <td>0.86</td>
+            <td>0.83</td>
+            <td>0.85</td>
+            <td>0.86</td>
+        </tr>
+    </tbody>
+</table>
+The model can be loaded as follows:
+```python
+# pip install transformers
+from transformers import AutoTokenizer, AutoModel
+tokenizer = AutoTokenizer.from_pretrained("tinkoff-ai/context_similarity")
+model = AutoModel.from_pretrained("tinkoff-ai/context_similarity")
+# model.cuda()
+```