Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
BAAI
/
bge-reranker-large
like
346
Follow
Beijing Academy of Artificial Intelligence
688
Feature Extraction
Transformers
PyTorch
ONNX
Safetensors
English
Chinese
xlm-roberta
text-classification
mteb
Eval Results
text-embeddings-inference
Inference Endpoints
arxiv:
5 papers
License:
mit
Model card
Files
Files and versions
Community
24
Train
Deploy
Use this model
执行性能问题
#19
by
fffff123
- opened
Jun 5
Discussion
fffff123
Jun 5
model(**inputs, return_dict=True).logits.view(-1, ).float() 这行代码执行会耗时很大,求教是啥原因呢,要2s左右了,怎么优化呢?
FlagEmbedding 、 Huggingface transformers、reranker with the ONNX files、reranker with infinity 这几种调用方式,性能有区别么
See translation
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Comment
·
Sign up
or
log in
to comment