Bad performance of bge-reranker-v2-gemma compare with bge-reranker-v2-m3
#22
by
shaunxu
- opened
As described in README for better performance it's recommended to use bge-reranker-v2-gemma. But in my case it takes 20 - 30 seconds to compute scores while bge-reranker-v2-m3 only needs 0.5 seconds.
PS, I tested on my MacBook Pro M1 with cpu. Not sure if this is the case.
Hi, Do you know hot wo depoy bge-reranker-v2-m3 on text-embeddings-inference