Post
2963
In the vector search setup, we normally combine a fast embedding model and an accurate but slow reranker model.
The newly released @jinaai rerankers are small in size and almost as accurate as our base reranker. This means given a time constraint, it can scoring more candidate documents from embedding models and have a better chance to feed LLM the correct context for RAG generation.
These models are available on Huggingface and has been integrated into the latest SentenceTransformers 2.7.0. Check it out!
jinaai/jina-reranker-v1-turbo-en
jinaai/jina-reranker-v1-tiny-en
The newly released @jinaai rerankers are small in size and almost as accurate as our base reranker. This means given a time constraint, it can scoring more candidate documents from embedding models and have a better chance to feed LLM the correct context for RAG generation.
These models are available on Huggingface and has been integrated into the latest SentenceTransformers 2.7.0. Check it out!
jinaai/jina-reranker-v1-turbo-en
jinaai/jina-reranker-v1-tiny-en