zhichao-geng
commited on
Commit
•
1acd452
1
Parent(s):
5ffdd1f
Update README.md
Browse files
README.md
CHANGED
@@ -28,6 +28,9 @@ Overall, the v2 series of models have better search relevance, efficiency and in
|
|
28 |
| [opensearch-neural-sparse-encoding-doc-v2-mini](https://huggingface.co/opensearch-project/opensearch-neural-sparse-encoding-doc-v2-mini) | ✔️ | 23M | 0.497 | 1.7 |
|
29 |
|
30 |
## Overview
|
|
|
|
|
|
|
31 |
This is a learned sparse retrieval model. It encodes the queries and documents to 30522 dimensional **sparse vectors**. The non-zero dimension index means the corresponding token in the vocabulary, and the weight means the importance of the token.
|
32 |
|
33 |
This model is trained on MS MARCO dataset.
|
|
|
28 |
| [opensearch-neural-sparse-encoding-doc-v2-mini](https://huggingface.co/opensearch-project/opensearch-neural-sparse-encoding-doc-v2-mini) | ✔️ | 23M | 0.497 | 1.7 |
|
29 |
|
30 |
## Overview
|
31 |
+
- **Paper**: [Towards Competitive Search Relevance For Inference-Free Learned Sparse Retrievers](https://arxiv.org/abs/2411.04403)
|
32 |
+
- **Fine-tuning sample**: [opensearch-sparse-model-tuning-sample](https://github.com/zhichao-aws/opensearch-sparse-model-tuning-sample)
|
33 |
+
|
34 |
This is a learned sparse retrieval model. It encodes the queries and documents to 30522 dimensional **sparse vectors**. The non-zero dimension index means the corresponding token in the vocabulary, and the weight means the importance of the token.
|
35 |
|
36 |
This model is trained on MS MARCO dataset.
|