michaelfeil
/

ct2fast-all-MiniLM-L6-v2

@@ -38,7 +38,7 @@ Speedup inference while reducing memory by 2x-4x using int8 inference in C++ on
 quantized version of [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)
 ```bash
-pip install hf-hub-ctranslate2>=2.12.0 ctranslate2>=3.16.0
 ```
 ```python
@@ -78,16 +78,20 @@ embeddings = model.encode(
 print(embeddings.shape, embeddings)
 scores = (embeddings @ embeddings.T) * 100
 ```
-Checkpoint compatible to [ctranslate2>=3.16.0](https://github.com/OpenNMT/CTranslate2)
 and [hf-hub-ctranslate2>=2.12.0](https://github.com/michaelfeil/hf-hub-ctranslate2)
 - `compute_type=int8_float16` for `device="cuda"`
 - `compute_type=int8`  for `device="cpu"`
-Converted on 2023-06-19 using
 ```
-ct2-transformers-converter --model sentence-transformers/all-MiniLM-L6-v2 --output_dir ~/tmp-ct2fast-all-MiniLM-L6-v2 --force --copy_files config_sentence_transformers.json tokenizer.json modules.json README.md tokenizer_config.json sentence_bert_config.json data_config.json vocab.txt special_tokens_map.json .gitattributes --trust_remote_code
 ```
 # Licence and other remarks:

 quantized version of [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2)
 ```bash
+pip install hf-hub-ctranslate2>=2.12.0 ctranslate2>=3.17.1
 ```
 ```python
 print(embeddings.shape, embeddings)
 scores = (embeddings @ embeddings.T) * 100
+# Hint: you can also host this code via REST API and
+# via github.com/michaelfeil/infinity
 ```
+Checkpoint compatible to [ctranslate2>=3.17.1](https://github.com/OpenNMT/CTranslate2)
 and [hf-hub-ctranslate2>=2.12.0](https://github.com/michaelfeil/hf-hub-ctranslate2)
 - `compute_type=int8_float16` for `device="cuda"`
 - `compute_type=int8`  for `device="cpu"`
+Converted on 2023-10-13 using
 ```
+LLama-2 -> removed <pad> token.
 ```
 # Licence and other remarks:

model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2abb237beb39bae980a7537a16a1fe5a0f0be2184be1d9f39f755b731a582adc
-size 90857292

 version https://git-lfs.github.com/spec/v1
+oid sha256:8e02198a1a1480129f35fede1751d0406a43e5ea8e7abb618ac58285e974cd6e
+size 45430860