Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,7 @@ language:
|
|
6 |
|
7 |
# akhooli/arabic-colbertv2-711k-norm
|
8 |
This is a ColBERT V2 model trained on [Arabic mMARCO dataset sample](https://huggingface.co/datasets/akhooli/ar-mmarco-sample) after removing queries with Latin words (711K queries).
|
9 |
-
It is not fully trained, but is good for many tasks especially ranking.
|
10 |
The dataset was normalized before training, so please normalize your query and docs before using it.
|
11 |
```python
|
12 |
from unicodedata import normalize
|
|
|
6 |
|
7 |
# akhooli/arabic-colbertv2-711k-norm
|
8 |
This is a ColBERT V2 model trained on [Arabic mMARCO dataset sample](https://huggingface.co/datasets/akhooli/ar-mmarco-sample) after removing queries with Latin words (711K queries).
|
9 |
+
It is not fully trained (22000 steps only), but is good for many tasks especially ranking and information retrieval (semantic search).
|
10 |
The dataset was normalized before training, so please normalize your query and docs before using it.
|
11 |
```python
|
12 |
from unicodedata import normalize
|