---
language: []
library_name: sentence-transformers
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:98928
- loss:MultipleNegativesRankingLoss
base_model: sentence-transformers/multi-qa-mpnet-base-dot-v1
datasets: []
metrics:
- cosine_accuracy@1
- cosine_accuracy@3
- cosine_accuracy@5
- cosine_accuracy@10
- cosine_precision@1
- cosine_precision@3
- cosine_precision@5
- cosine_precision@10
- cosine_recall@1
- cosine_recall@3
- cosine_recall@5
- cosine_recall@10
- cosine_ndcg@10
- cosine_mrr@10
- cosine_map@100
- dot_accuracy@1
- dot_accuracy@3
- dot_accuracy@5
- dot_accuracy@10
- dot_precision@1
- dot_precision@3
- dot_precision@5
- dot_precision@10
- dot_recall@1
- dot_recall@3
- dot_recall@5
- dot_recall@10
- dot_ndcg@10
- dot_mrr@10
- dot_map@100
widget:
- source_sentence: Abnormal wakefulness, loss of self-awareness, involuntary trance
sentences:
- 'See also: Neonatal lupus erythematosus'
- 'Scalp-ear-nipple syndrome, as its name suggests, is a condition characterized
by abnormalities of the scalp, ears, and nipples. Less frequently, affected individuals
have problems affecting other parts of the body. The features of this disorder
can vary even within the same family.
Babies with scalp-ear-nipple syndrome are born with a condition called aplasia
cutis congenita, which involves patchy abnormal areas (lesions) on the scalp.
These lesions are firm, raised, hairless nodules that resemble open wounds or
ulcers at birth, but that heal during childhood.
The external ears of people with scalp-ear-nipple syndrome may be small, cup-shaped,
folded over, or otherwise mildly misshapen. Hearing is generally normal. Affected
individuals also have nipples that are underdeveloped (hypothelia) or absent (athelia).
In some cases the underlying breast tissue is absent as well (amastia).'
- "Abnormal state of wakefulness or altered state of consciousness\n\nFor other\
\ uses, see Trance (disambiguation).\n\nThis article needs additional citations\
\ for verification. Please help improve this article by adding citations to reliable\
\ sources. Unsourced material may be challenged and removed. \nFind sources:\
\ \"Trance\" – news · newspapers · books · scholar · JSTOR (August 2010) (Learn\
\ how and when to remove this template message) \n \nDissociative trance \n\
The Oracle at Delphi was famous for her divinatory trances throughout the ancient\
\ Mediterranean world. Oil painting, John Collier, 1891 \nSpecialtyPsychiatry\
\ \n \nTrance is an abnormal state of wakefulness in which a person is not self-aware\
\ and is either altogether unresponsive to external stimuli (but nevertheless\
\ capable of pursuing and realizing an aim) or is selectively responsive in following\
\ the directions of the person (if any) who has induced the trance. Trance states\
\ may occur involuntarily and unbidden."
- source_sentence: respiratory infections, recurrent infections, primary immunodeficiency
sentences:
- 'A number sign (#) is used with this entry because of evidence that autosomal
dominant common variable immunodeficiency-12 (CVID12) is caused by heterozygous
mutation in the NFKB1 gene (164011) on chromosome 4q24.
Description
Common variable immunodeficiency-12 is an autosomal dominant primary immunodeficiency
characterized by recurrent infections, mainly respiratory, associated with hypogammaglobulinemia.
The disorder shows a highly variable age at onset and highly variable disease
severity, even within the same family. Some patients have features of autoimmunity
(summary by Fliegauf et al., 2015).
For a general description and a discussion of genetic heterogeneity of common
variable immunodeficiency, see CVID1 (607594).
Clinical Features'
- Kyasanura forest disease (KFD), caused by the KFD virus, is an arbovirus characterized
by an initial fever, headache and myalgia that can progress to a hemorrhagic disease
and that in some cases is followed by a second phase characterized by neurological
manifestations.
- 'A number sign (#) is used with this entry because of evidence that X-linked syndromic
mental retardation-33 (MRXS33) is caused by mutation in the TAF1 gene (313650)
on chromosome Xq13.
Description
X-linked syndromic mental retardation-33 is an X-linked recessive neurodevelopmental
disorder characterized by delayed psychomotor development, intellectual disability,
and characteristic facial features (summary by O''Rawe et al., 2015).
Clinical Features'
- source_sentence: Common variable immunodeficiency, recurrent infections, impaired
antibody production
sentences:
- 'A number sign (#) is used with this entry because this form of common variable
immunodeficiency (CVID), referred to here as CVID5, is caused by homozygous mutation
in the CD20 gene (MS4A1; 112210) on chromosome 11q13.
For a general description and a discussion of genetic heterogeneity of common
variable immunodeficiency, see CVID1 (607594).
Clinical Features'
- 'A number sign (#) is used with this entry because of evidence that the Stanescu
type of spondyloepiphyseal dysplasia (SEDSTN) is caused by heterozygous mutation
in the COL2A1 gene (120140) on chromosome 12q13.
Description'
- '## Description
Macular dystrophies are inherited retinal dystrophies in which various forms of
deposits, pigmentary changes, and atrophic lesions are observed in the macula
lutea, the cone-rich region of the central retina. Vitelliform macular dystrophies
(VMDs) form a subset of macular dystrophies characterized by round yellow deposits,
usually at the center of the macula and containing lipofuscin, a chemically heterogeneous
pigment visualized by autofluorescence imaging of the fundus (summary by Manes
et al., 2013). In contrast to typical VMD (see 153700), patients with atypical
VMD may exhibit normal electrooculography, even when severe loss of vision is
present, and fluorescein angiography is thus the most reliable test for identifying
affected individuals (Hittner et al., 1984).
### Genetic Heterogeneity of Vitelliform Macular Dystrophy'
- source_sentence: Growth retardation, hearing impairment, joint hypermobility, sacral
caudal remnant
sentences:
- 'A number sign (#) is used with this entry because Bruck syndrome-2 (BRKS2) is
caused by homozygous mutation in the PLOD2 gene (601865), which encodes telopeptide
lysyl hydroxylase, on chromosome 3q24.
For a phenotypic description and a discussion of genetic heterogeneity of Bruck
syndrome, see Bruck syndrome-1 (259450).
Clinical Features
Ha-Vinh et al. (2004) described a child with Bruck syndrome who was the offspring
of healthy nonconsanguineous Turkish parents. At birth, pterygia were present
at the left elbow and at both knees, and extension of these joints was limited.
Contractures were also present at the wrists, and there were bilateral clubfeet.
Bilateral inguinal hernias were present. A fracture of the left arm was recognized
immediately after birth, and the boy had 2 more fractures in the first 3 months
of life. His urine contained high levels of hydroxyproline but low levels of collagen
crosslinks degradation products.'
- '## Summary
### Clinical characteristics.
Thrombocytopenia absent radius (TAR) syndrome is characterized by bilateral absence
of the radii with the presence of both thumbs and thrombocytopenia (<50 platelets/nL)
that is generally transient. Thrombocytopenia may be congenital or may develop
within the first few weeks to months of life; in general, thrombocytopenic episodes
decrease with age. Cow''s milk allergy is common and can be associated with exacerbation
of thrombocytopenia. Other anomalies of the skeleton (upper and lower limbs, ribs,
and vertebrae), heart, and genitourinary system (renal anomalies and agenesis
of uterus, cervix, and upper part of the vagina) can occur.
### Diagnosis/testing.'
- A rare multiple congenital anomalies/dysmorphic syndrome characterized by global
developmental delay, intellectual disability, growth retardation, hearing impairment,
characteristic facial dysmorphology (including prominent supraorbital ridges,
downslanting palpebral fissures, deep-set eyes, long face, sagging cheeks, anteverted
nares, and pointed chin), generalized hypotonia, joint hypermobility, gluteal
crease with sacral caudal remnant and sacral dimple, and variable neurological
features. Various ophthalmic, cutaneous, musculoskeletal, gastrointestinal, and
cardiovascular anomalies have also been described.
- source_sentence: ear malformations, nipple abnormalities, dental anomalies
sentences:
- "This article is an orphan, as no other articles link to it. Please introduce\
\ links to this page from related articles; try the Find link tool for suggestions.\
\ (July 2016) \n \nInguinal lymphadenopathy \nInguinal lymphadenopathy \n\
\ \nInguinal lymphadenopathy causes swollen lymph nodes in the groin area. It\
\ can be a symptom of infective or neoplastic processes. Infective aetiologies\
\ include Tuberculosis, HIV, non-specific or reactive lymphadenopathy to recent\
\ lower limb infection or groin infections. Another notable infectious cause is\
\ Lymphogranuloma venereum, which is a sexually transmitted infection of the lymphatic\
\ system. Neoplastic aetiologies include lymphoma, leukaemia and metastatic disease\
\ from primary tumours in the lower limb, external genitalia or perianal region\
\ and melanoma.\n\n## References[edit]\n\n * Ferrer R (October 1998). \"Lymphadenopathy:\
\ differential diagnosis and evaluation\". Am Fam Physician. 58 (6): 1313–20.\
\ PMID 9803196.\n\n## Further reading[edit]"
- 'A number sign (#) is used with this entry because scalp-ear-nipple syndrome (SENS)
is caused by heterozygous mutation in the KCTD1 gene (613420) on chromosome 18q11.
Description
Scalp-ear-nipple syndrome is characterized by aplasia cutis congenita of the scalp,
breast anomalies that range from hypothelia or athelia to amastia, and minor anomalies
of the external ears. Less frequent clinical characteristics include nail dystrophy,
dental anomalies, cutaneous syndactyly of the digits, and renal malformations.
Penetrance appears to be high, although there is substantial variable expressivity
within families (Marneros et al., 2013).
Clinical Features'
- Familial multiple meningioma is a rare, benign neoplasm of the central nervous
system characterized by the development of multiple or, rarely, solitary meningiomas
in two or more blood relatives, without other apparent syndromic manifestations.
Depending on the localization, growth rate and size of the tumors, patients can
present with subtle, gradually worsening or abrupt and severe neurological compromise
or can be completely asymptomatic.
pipeline_tag: sentence-similarity
model-index:
- name: SentenceTransformer based on sentence-transformers/multi-qa-mpnet-base-dot-v1
results:
- task:
type: information-retrieval
name: Information Retrieval
dataset:
name: Unknown
type: unknown
metrics:
- type: cosine_accuracy@1
value: 0.18070477009024494
name: Cosine Accuracy@1
- type: cosine_accuracy@3
value: 0.5426514825956167
name: Cosine Accuracy@3
- type: cosine_accuracy@5
value: 0.7380747743876236
name: Cosine Accuracy@5
- type: cosine_accuracy@10
value: 0.8160721959604641
name: Cosine Accuracy@10
- type: cosine_precision@1
value: 0.18070477009024494
name: Cosine Precision@1
- type: cosine_precision@3
value: 0.1808838275318722
name: Cosine Precision@3
- type: cosine_precision@5
value: 0.1476149548775247
name: Cosine Precision@5
- type: cosine_precision@10
value: 0.08160721959604642
name: Cosine Precision@10
- type: cosine_recall@1
value: 0.18070477009024494
name: Cosine Recall@1
- type: cosine_recall@3
value: 0.5426514825956167
name: Cosine Recall@3
- type: cosine_recall@5
value: 0.7380747743876236
name: Cosine Recall@5
- type: cosine_recall@10
value: 0.8160721959604641
name: Cosine Recall@10
- type: cosine_ndcg@10
value: 0.49469594615283
name: Cosine Ndcg@10
- type: cosine_mrr@10
value: 0.39074511770043246
name: Cosine Mrr@10
- type: cosine_map@100
value: 0.3952600557331103
name: Cosine Map@100
- type: dot_accuracy@1
value: 0.18274602492479589
name: Dot Accuracy@1
- type: dot_accuracy@3
value: 0.5412548345509239
name: Dot Accuracy@3
- type: dot_accuracy@5
value: 0.7430167597765364
name: Dot Accuracy@5
- type: dot_accuracy@10
value: 0.8167168027503223
name: Dot Accuracy@10
- type: dot_precision@1
value: 0.18274602492479589
name: Dot Precision@1
- type: dot_precision@3
value: 0.18041827818364134
name: Dot Precision@3
- type: dot_precision@5
value: 0.1486033519553073
name: Dot Precision@5
- type: dot_precision@10
value: 0.08167168027503223
name: Dot Precision@10
- type: dot_recall@1
value: 0.18274602492479589
name: Dot Recall@1
- type: dot_recall@3
value: 0.5412548345509239
name: Dot Recall@3
- type: dot_recall@5
value: 0.7430167597765364
name: Dot Recall@5
- type: dot_recall@10
value: 0.8167168027503223
name: Dot Recall@10
- type: dot_ndcg@10
value: 0.4956715454485796
name: Dot Ndcg@10
- type: dot_mrr@10
value: 0.391808804169147
name: Dot Mrr@10
- type: dot_map@100
value: 0.39626188359327835
name: Dot Map@100
---
# SentenceTransformer based on sentence-transformers/multi-qa-mpnet-base-dot-v1
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/multi-qa-mpnet-base-dot-v1](https://huggingface.co/sentence-transformers/multi-qa-mpnet-base-dot-v1). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [sentence-transformers/multi-qa-mpnet-base-dot-v1](https://huggingface.co/sentence-transformers/multi-qa-mpnet-base-dot-v1)
- **Maximum Sequence Length:** 512 tokens
- **Output Dimensionality:** 768 tokens
- **Similarity Function:** Dot Product
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: MPNetModel
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("sentence_transformers_model_id")
# Run inference
sentences = [
'ear malformations, nipple abnormalities, dental anomalies',
'A number sign (#) is used with this entry because scalp-ear-nipple syndrome (SENS) is caused by heterozygous mutation in the KCTD1 gene (613420) on chromosome 18q11.\n\nDescription\n\nScalp-ear-nipple syndrome is characterized by aplasia cutis congenita of the scalp, breast anomalies that range from hypothelia or athelia to amastia, and minor anomalies of the external ears. Less frequent clinical characteristics include nail dystrophy, dental anomalies, cutaneous syndactyly of the digits, and renal malformations. Penetrance appears to be high, although there is substantial variable expressivity within families (Marneros et al., 2013).\n\nClinical Features',
'This article is an orphan, as no other articles link to it. Please introduce links to this page from related articles; try the Find link tool for suggestions. (July 2016) \n \nInguinal lymphadenopathy \nInguinal lymphadenopathy \n \nInguinal lymphadenopathy causes swollen lymph nodes in the groin area. It can be a symptom of infective or neoplastic processes. Infective aetiologies include Tuberculosis, HIV, non-specific or reactive lymphadenopathy to recent lower limb infection or groin infections. Another notable infectious cause is Lymphogranuloma venereum, which is a sexually transmitted infection of the lymphatic system. Neoplastic aetiologies include lymphoma, leukaemia and metastatic disease from primary tumours in the lower limb, external genitalia or perianal region and melanoma.\n\n## References[edit]\n\n * Ferrer R (October 1998). "Lymphadenopathy: differential diagnosis and evaluation". Am Fam Physician. 58 (6): 1313–20. PMID 9803196.\n\n## Further reading[edit]',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
```
## Evaluation
### Metrics
#### Information Retrieval
* Evaluated with [InformationRetrievalEvaluator
](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.InformationRetrievalEvaluator)
| Metric | Value |
|:--------------------|:-----------|
| cosine_accuracy@1 | 0.1807 |
| cosine_accuracy@3 | 0.5427 |
| cosine_accuracy@5 | 0.7381 |
| cosine_accuracy@10 | 0.8161 |
| cosine_precision@1 | 0.1807 |
| cosine_precision@3 | 0.1809 |
| cosine_precision@5 | 0.1476 |
| cosine_precision@10 | 0.0816 |
| cosine_recall@1 | 0.1807 |
| cosine_recall@3 | 0.5427 |
| cosine_recall@5 | 0.7381 |
| cosine_recall@10 | 0.8161 |
| cosine_ndcg@10 | 0.4947 |
| cosine_mrr@10 | 0.3907 |
| cosine_map@100 | 0.3953 |
| dot_accuracy@1 | 0.1827 |
| dot_accuracy@3 | 0.5413 |
| dot_accuracy@5 | 0.743 |
| dot_accuracy@10 | 0.8167 |
| dot_precision@1 | 0.1827 |
| dot_precision@3 | 0.1804 |
| dot_precision@5 | 0.1486 |
| dot_precision@10 | 0.0817 |
| dot_recall@1 | 0.1827 |
| dot_recall@3 | 0.5413 |
| dot_recall@5 | 0.743 |
| dot_recall@10 | 0.8167 |
| dot_ndcg@10 | 0.4957 |
| dot_mrr@10 | 0.3918 |
| **dot_map@100** | **0.3963** |
## Training Details
### Training Dataset
#### Unnamed Dataset
* Size: 98,928 training samples
* Columns: queries
and chunks
* Approximate statistics based on the first 1000 samples:
| | queries | chunks |
|:--------|:---------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|
| type | string | string |
| details |
fever, malaise, headaches, lymphadenopathy
| A rare, acquired, self-limiting, infectious disease due to the mite-borne bacteria Rickettsia akari characterized by an asymptomatic, 0.5 to 2 cm in diameter papulovesicle that typically ulcerates and forms an eschar, followed by a generalized papulovesicular rash associating variable constitutional symptoms, such as localized lymphadenopathy, fever, malaise, and headaches. Additonal symptoms may include diaphoresis, myalgia and, less frequently, rhinorrhea, pharyngitis, nausea, vomiting, splenomegaly, conjunctival hyperemia, and abdominal pain. Systemic symtoms resolve within 6-10 days.
|
| rash, papulovesicular, generalized, constitutional symptoms
| A rare, acquired, self-limiting, infectious disease due to the mite-borne bacteria Rickettsia akari characterized by an asymptomatic, 0.5 to 2 cm in diameter papulovesicle that typically ulcerates and forms an eschar, followed by a generalized papulovesicular rash associating variable constitutional symptoms, such as localized lymphadenopathy, fever, malaise, and headaches. Additonal symptoms may include diaphoresis, myalgia and, less frequently, rhinorrhea, pharyngitis, nausea, vomiting, splenomegaly, conjunctival hyperemia, and abdominal pain. Systemic symtoms resolve within 6-10 days.
|
| myalgia, diaphoresis, nausea, vomiting
| A rare, acquired, self-limiting, infectious disease due to the mite-borne bacteria Rickettsia akari characterized by an asymptomatic, 0.5 to 2 cm in diameter papulovesicle that typically ulcerates and forms an eschar, followed by a generalized papulovesicular rash associating variable constitutional symptoms, such as localized lymphadenopathy, fever, malaise, and headaches. Additonal symptoms may include diaphoresis, myalgia and, less frequently, rhinorrhea, pharyngitis, nausea, vomiting, splenomegaly, conjunctival hyperemia, and abdominal pain. Systemic symtoms resolve within 6-10 days.
|
* Loss: [MultipleNegativesRankingLoss
](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
```json
{
"scale": 1,
"similarity_fct": "dot_score"
}
```
### Evaluation Dataset
#### Unnamed Dataset
* Size: 9,308 evaluation samples
* Columns: queries
and chunks
* Approximate statistics based on the first 1000 samples:
| | queries | chunks |
|:--------|:---------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|
| type | string | string |
| details | facial features, overgrowth, learning disabilities, delayed development
| Sotos syndrome is a condition characterized mainly by distinctive facial features; overgrowth in childhood; and learning disabilities or delayed development. Facial features may include a long, narrow face; a high forehead; flushed (reddened) cheeks; a small, pointed chin; and down-slanting palpebral fissures. Affected infants and children tend to grow quickly; they are significantly taller than their siblings and peers and have a large head. Other signs and symptoms may include intellectual disability; behavioral problems; problems with speech and language; and/or weak muscle tone (hypotonia). Sotos syndrome is usually caused by a mutation in the NSD1 gene and is inherited in an autosomal dominant manner. About 95% of cases are due to a new mutation in the affected person and occur sporadically (are not inherited).
|
| long face, high forehead, flushed cheeks, small chin, down-slanting palpebral fissures
| Sotos syndrome is a condition characterized mainly by distinctive facial features; overgrowth in childhood; and learning disabilities or delayed development. Facial features may include a long, narrow face; a high forehead; flushed (reddened) cheeks; a small, pointed chin; and down-slanting palpebral fissures. Affected infants and children tend to grow quickly; they are significantly taller than their siblings and peers and have a large head. Other signs and symptoms may include intellectual disability; behavioral problems; problems with speech and language; and/or weak muscle tone (hypotonia). Sotos syndrome is usually caused by a mutation in the NSD1 gene and is inherited in an autosomal dominant manner. About 95% of cases are due to a new mutation in the affected person and occur sporadically (are not inherited).
|
| intellectual disability, behavioral problems, speech and language difficulties, hypotonia
| Sotos syndrome is a condition characterized mainly by distinctive facial features; overgrowth in childhood; and learning disabilities or delayed development. Facial features may include a long, narrow face; a high forehead; flushed (reddened) cheeks; a small, pointed chin; and down-slanting palpebral fissures. Affected infants and children tend to grow quickly; they are significantly taller than their siblings and peers and have a large head. Other signs and symptoms may include intellectual disability; behavioral problems; problems with speech and language; and/or weak muscle tone (hypotonia). Sotos syndrome is usually caused by a mutation in the NSD1 gene and is inherited in an autosomal dominant manner. About 95% of cases are due to a new mutation in the affected person and occur sporadically (are not inherited).
|
* Loss: [MultipleNegativesRankingLoss
](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
```json
{
"scale": 1,
"similarity_fct": "dot_score"
}
```
### Training Hyperparameters
#### Non-Default Hyperparameters
- `eval_strategy`: steps
- `per_device_train_batch_size`: 32
- `per_device_eval_batch_size`: 32
- `learning_rate`: 2e-05
- `num_train_epochs`: 25
- `warmup_ratio`: 0.1
- `fp16`: True
- `load_best_model_at_end`: True
- `eval_on_start`: True
- `batch_sampler`: no_duplicates
#### All Hyperparameters