sap-ai-research
/

miCSE

Sentence Similarity

feature-extraction

text-embeddings-inference

Inference Endpoints

Model card Files Files and versions Community

TJKlein commited on Nov 18, 2022

Commit

de95105

·

1 Parent(s): d04d89e

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -10,6 +10,14 @@ Language model of the pre-print arXiv paper titled: "_**miCSE**: Mutual Informat
 The **miCSE** language model is trained for sentence similarity computation. Training the model imposes alignment between the attention pattern of different views (embeddings of augmentations) during contrastive learning. Learning sentence embeddings with **miCSE** entails enforcing the syntactic consistency across augmented views for every single sentence, making contrastive self-supervised learning more sample efficient. Sentence representations correspond to the embedding of the _**[CLS]**_ token.
 # Benchmark
 Model results on SentEval Benchmark:

 The **miCSE** language model is trained for sentence similarity computation. Training the model imposes alignment between the attention pattern of different views (embeddings of augmentations) during contrastive learning. Learning sentence embeddings with **miCSE** entails enforcing the syntactic consistency across augmented views for every single sentence, making contrastive self-supervised learning more sample efficient. Sentence representations correspond to the embedding of the _**[CLS]**_ token.
+# Usage
+```shell
+tokenizer = AutoTokenizer.from_pretrained("sap-ai-research/<----Enter Model Name---->")
+model = AutoModelWithLMHead.from_pretrained("sap-ai-research/<----Enter Model Name---->")
+```
 # Benchmark
 Model results on SentEval Benchmark: