onnx and openvino - huge files to download

#32
by venkey - opened

few days ago, Bunch of files onnx and openvino files are included and they are getting downloaded as part of model, as the file sizes are huge, its taking more time, is there anyway we can control this behavior, i.e to download only the required files or specific file type.

Sentence Transformers org

Hello!

Indeed, the ONNX/OV files accompany the recent v3.2.0 release, which brings faster inference via various means: https://sbert.net/docs/sentence_transformer/usage/efficiency.html

Only old Sentence Transformers versions (2.2.2 or older I believe) download all files from the repository - all newer versions only download the required files.
My strong recommendation is to use a newer Sentence Transformers version. Note that all newer versions are designed to be backwards compatible with loading and computing embeddings.

  • Tom Aarsen

Sign up or log in to comment