wwydmanski
/

specter2_pubmed-v0.5

@@ -1,49 +1,81 @@
 ---
 base_model: allenai/specter2_base
 library_name: sentence-transformers
 pipeline_tag: sentence-similarity
 tags:
 - sentence-transformers
 - sentence-similarity
 - feature-extraction
 - generated_from_trainer
-- dataset_size:6574
 - loss:MultipleNegativesRankingLoss
 widget:
-- source_sentence: sigma N protein interactions
   sentences:
-  - 'Smoking Relapse After Lung Transplantation: Is a Second Transplant Justified? '
-  - 'Core RNA polymerase and promoter DNA interactions of purified domains of sigma
-    N: bipartite functions. '
-  - 'Protein-protein interactions mapped by artificial proteases: where sigma factors
-    bind to RNA polymerase. '
-- source_sentence: Frailty pathway co-design
   sentences:
-  - 'High-Sensitivity Cardiac Troponin I Levels in Normal and Hypertensive Pregnancy. '
-  - 'The systematic approach to improving care for Frail Older Patients (SAFE) study:
-    A protocol for co-designing a frail older person''s pathway. '
-  - 'Frailty: successful clinical practice implementation. '
-- source_sentence: Diurnal lipid metabolism in lactating sheep
   sentences:
-  - 'Interpreting and applying the EUFEST results using number needed to treat: antipsychotic
-    effectiveness in first-episode schizophrenia. '
-  - 'Diurnal variations in the concentration, arteriovenous difference, extraction
-    ratio, and uptake of 3-hydroxybutyrate and plasma free fatty acids in the hind
-    limb of lactating sheep. '
-  - 'Diurnal regulation of milk lipid production and milk secretion in the rat: effect
-    of dietary protein and energy restriction. '
-- source_sentence: Ectopic gastric mucosa
   sentences:
-  - '[Ectopic cardia and gastroesophageal reflux]. '
-  - 'A bacterial toxicity assay performed with microplates, microluminometry and Microtox
-    reagent. '
-  - 'Gastric polyp. '
-- source_sentence: monograph editing
   sentences:
-  - 'Monographs editor. '
-  - 'Maternal stress and high-fat diet effect on maternal behavior, milk composition,
-    and pup ingestive behavior. '
-  - 'The editing life. '
 ---
 # SentenceTransformer based on allenai/specter2_base
@@ -96,9 +128,9 @@ from sentence_transformers import SentenceTransformer
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
-    'monograph editing',
-    'Monographs editor. ',
-    'The editing life. ',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
@@ -134,6 +166,22 @@ You can finetune this model on your own dataset.
 *List how the model may foreseeably be misused and address what users ought not to do with the model.*
 -->
 <!--
 ## Bias, Risks and Limitations
@@ -153,19 +201,19 @@ You can finetune this model on your own dataset.
 #### json
 * Dataset: json
-* Size: 6,574 training samples
 * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
   |         | anchor                                                                           | positive                                                                          | negative                                                                          |
   |:--------|:---------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
   | type    | string                                                                           | string                                                                            | string                                                                            |
-  | details | <ul><li>min: 3 tokens</li><li>mean: 7.59 tokens</li><li>max: 33 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 19.89 tokens</li><li>max: 70 tokens</li></ul> | <ul><li>min: 3 tokens</li><li>mean: 11.97 tokens</li><li>max: 50 tokens</li></ul> |
 * Samples:
-  | anchor                                           | positive                                                                                    | negative                                                                                                        |
-  |:-------------------------------------------------|:--------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------------------------|
-  | <code>α-Alumina Nanoparticle Grafting</code>     | <code>Grafting PMMA Brushes from α-Alumina Nanoparticles via SI-ATRP. </code>               | <code>Mesoporous alumina from colloidal biotemplating of Al clusters. </code>                                   |
-  | <code>Congenital candidiasis septic shock</code> | <code>Congenital candidiasis presenting as septic shock without rash. </code>               | <code>Congenital cutaneous candidiasis: clinical presentation, pathogenesis, and management guidelines. </code> |
-  | <code>Chronic Venous Occlusion</code>            | <code>Anatomic response of canine hindlimb vasculature to chronic venous occlusion. </code> | <code>Chronic venous insufficiency. </code>                                                                     |
 * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
   ```json
   {
@@ -177,10 +225,11 @@ You can finetune this model on your own dataset.
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
 - `per_device_train_batch_size`: 32
 - `per_device_eval_batch_size`: 32
 - `learning_rate`: 2e-05
-- `num_train_epochs`: 1
 - `lr_scheduler_type`: cosine_with_restarts
 - `warmup_ratio`: 0.1
 - `bf16`: True
@@ -191,7 +240,7 @@ You can finetune this model on your own dataset.
 - `overwrite_output_dir`: False
 - `do_predict`: False
-- `eval_strategy`: no
 - `prediction_loss_only`: True
 - `per_device_train_batch_size`: 32
 - `per_device_eval_batch_size`: 32
@@ -206,7 +255,7 @@ You can finetune this model on your own dataset.
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
 - `max_grad_norm`: 1.0
-- `num_train_epochs`: 1
 - `max_steps`: -1
 - `lr_scheduler_type`: cosine_with_restarts
 - `lr_scheduler_kwargs`: {}
@@ -305,77 +354,45 @@ You can finetune this model on your own dataset.
 </details>
 ### Training Logs
-| Epoch  | Step | Training Loss |
-|:------:|:----:|:-------------:|
-| 0.0145 | 1    | 2.8777        |
-| 0.0290 | 2    | 2.8723        |
-| 0.0435 | 3    | 2.7432        |
-| 0.0580 | 4    | 2.8806        |
-| 0.0725 | 5    | 2.3007        |
-| 0.0870 | 6    | 2.2423        |
-| 0.1014 | 7    | 1.995         |
-| 0.1159 | 8    | 1.5115        |
-| 0.1304 | 9    | 1.41          |
-| 0.1449 | 10   | 1.243         |
-| 0.1594 | 11   | 1.1634        |
-| 0.1739 | 12   | 1.1996        |
-| 0.1884 | 13   | 1.3653        |
-| 0.2029 | 14   | 1.5704        |
-| 0.2174 | 15   | 1.3556        |
-| 0.2319 | 16   | 1.4051        |
-| 0.2464 | 17   | 1.0999        |
-| 0.2609 | 18   | 1.0826        |
-| 0.2754 | 19   | 1.0449        |
-| 0.2899 | 20   | 1.0517        |
-| 0.3043 | 21   | 0.9716        |
-| 0.3188 | 22   | 1.1993        |
-| 0.3333 | 23   | 1.1375        |
-| 0.3478 | 24   | 0.9875        |
-| 0.3623 | 25   | 0.7656        |
-| 0.3768 | 26   | 1.2773        |
-| 0.3913 | 27   | 0.7802        |
-| 0.4058 | 28   | 0.882         |
-| 0.4203 | 29   | 1.0534        |
-| 0.4348 | 30   | 0.9073        |
-| 0.4493 | 31   | 0.916         |
-| 0.4638 | 32   | 0.9702        |
-| 0.4783 | 33   | 1.2868        |
-| 0.4928 | 34   | 1.0854        |
-| 0.5072 | 35   | 0.8832        |
-| 0.5217 | 36   | 0.9139        |
-| 0.5362 | 37   | 0.9032        |
-| 0.5507 | 38   | 0.965         |
-| 0.5652 | 39   | 0.7222        |
-| 0.5797 | 40   | 0.6682        |
-| 0.5942 | 41   | 0.8562        |
-| 0.6087 | 42   | 0.9248        |
-| 0.6232 | 43   | 0.9867        |
-| 0.6377 | 44   | 0.7328        |
-| 0.6522 | 45   | 0.7506        |
-| 0.6667 | 46   | 0.7952        |
-| 0.6812 | 47   | 0.7979        |
-| 0.6957 | 48   | 1.0043        |
-| 0.7101 | 49   | 1.0428        |
-| 0.7246 | 50   | 0.8772        |
-| 0.7391 | 51   | 0.6598        |
-| 0.7536 | 52   | 0.7804        |
-| 0.7681 | 53   | 0.599         |
-| 0.7826 | 54   | 0.7974        |
-| 0.7971 | 55   | 0.7489        |
-| 0.8116 | 56   | 0.8701        |
-| 0.8261 | 57   | 0.8903        |
-| 0.8406 | 58   | 0.7223        |
-| 0.8551 | 59   | 0.925         |
-| 0.8696 | 60   | 1.0247        |
-| 0.8841 | 61   | 0.7531        |
-| 0.8986 | 62   | 0.9684        |
-| 0.9130 | 63   | 0.7462        |
-| 0.9275 | 64   | 0.8555        |
-| 0.9420 | 65   | 0.8016        |
-| 0.9565 | 66   | 0.7603        |
-| 0.9710 | 67   | 1.1052        |
-| 0.9855 | 68   | 0.9505        |
-| 1.0    | 69   | 0.6259        |
 ### Framework Versions

 ---
 base_model: allenai/specter2_base
 library_name: sentence-transformers
+metrics:
+- cosine_accuracy
+- dot_accuracy
+- manhattan_accuracy
+- euclidean_accuracy
+- max_accuracy
 pipeline_tag: sentence-similarity
 tags:
 - sentence-transformers
 - sentence-similarity
 - feature-extraction
 - generated_from_trainer
+- dataset_size:10053
 - loss:MultipleNegativesRankingLoss
 widget:
+- source_sentence: HBV-endemic area diagnostic criteria comparison
   sentences:
+  - 'Comparison of usefulness of clinical diagnostic criteria for hepatocellular carcinoma
+    in a hepatitis B endemic area. '
+  - 'The validation of the 2010 American Association for the Study of Liver Diseases
+    guideline for the diagnosis of hepatocellular carcinoma in an endemic area. '
+  - 'Which admission electrocardiographic parameter is more powerful predictor of
+    no-reflow in patients with acute anterior myocardial infarction who underwent
+    primary percutaneous intervention? '
+- source_sentence: Family history of alcoholism classification schemes
   sentences:
+  - 'Developing the mentor/protege relationship. '
+  - 'Family history of alcoholism in schizophrenia. '
+  - 'Family history models of alcoholism: age of onset, consequences and dependence. '
+- source_sentence: Intellectual Property Commercialization
   sentences:
+  - 'ALEPH-2, a suspected anxiolytic and putative hallucinogenic phenylisopropylamine
+    derivative, is a 5-HT2a and 5-HT2c receptor agonist. '
+  - 'Technology transfer and monitoring practices. '
+  - '[From intellectual property to commercial property]. '
+- source_sentence: Transmembrane domain mutants
   sentences:
+  - 'Dysgerminoma; case with pulmonary metastases; result of treatment with irradiation
+    and male sex hormone. '
+  - 'Toward a high-resolution structure of phospholamban: design of soluble transmembrane
+    domain mutants. '
+  - 'Scanning N-glycosylation mutagenesis of membrane proteins. '
+- source_sentence: Six-coordinate low-spin iron(III) porphyrinate complexes
   sentences:
+  - 'Molecular structures and magnetic resonance spectroscopic investigations of highly
+    distorted six-coordinate low-spin iron(III) porphyrinate complexes. '
+  - 'Saddle-shaped six-coordinate iron(iii) porphyrin complex with unusual intermediate-spin
+    electronic structure. '
+  - 'Performing Economic Evaluation of Integrated Care: Highway to Hell or Stairway
+    to Heaven? '
+model-index:
+- name: SentenceTransformer based on allenai/specter2_base
+  results:
+  - task:
+      type: triplet
+      name: Triplet
+    dataset:
+      name: triplet dev
+      type: triplet-dev
+    metrics:
+    - type: cosine_accuracy
+      value: 0.606
+      name: Cosine Accuracy
+    - type: dot_accuracy
+      value: 0.395
+      name: Dot Accuracy
+    - type: manhattan_accuracy
+      value: 0.603
+      name: Manhattan Accuracy
+    - type: euclidean_accuracy
+      value: 0.615
+      name: Euclidean Accuracy
+    - type: max_accuracy
+      value: 0.615
+      name: Max Accuracy
 ---
 # SentenceTransformer based on allenai/specter2_base
 model = SentenceTransformer("sentence_transformers_model_id")
 # Run inference
 sentences = [
+    'Six-coordinate low-spin iron(III) porphyrinate complexes',
+    'Molecular structures and magnetic resonance spectroscopic investigations of highly distorted six-coordinate low-spin iron(III) porphyrinate complexes. ',
+    'Saddle-shaped six-coordinate iron(iii) porphyrin complex with unusual intermediate-spin electronic structure. ',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
 *List how the model may foreseeably be misused and address what users ought not to do with the model.*
 -->
+## Evaluation
+### Metrics
+#### Triplet
+* Dataset: `triplet-dev`
+* Evaluated with [<code>TripletEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.TripletEvaluator)
+| Metric              | Value     |
+|:--------------------|:----------|
+| **cosine_accuracy** | **0.606** |
+| dot_accuracy        | 0.395     |
+| manhattan_accuracy  | 0.603     |
+| euclidean_accuracy  | 0.615     |
+| max_accuracy        | 0.615     |
 <!--
 ## Bias, Risks and Limitations
 #### json
 * Dataset: json
+* Size: 10,053 training samples
 * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
   |         | anchor                                                                           | positive                                                                          | negative                                                                          |
   |:--------|:---------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
   | type    | string                                                                           | string                                                                            | string                                                                            |
+  | details | <ul><li>min: 4 tokens</li><li>mean: 7.49 tokens</li><li>max: 18 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 20.08 tokens</li><li>max: 48 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 12.46 tokens</li><li>max: 48 tokens</li></ul> |
 * Samples:
+  | anchor                                                       | positive                                                                                                            | negative                                                     |
+  |:-------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------|
+  | <code>COM-induced secretome changes in U937 monocytes</code> | <code>Characterization of calcium oxalate crystal-induced changes in the secretome of U937 human monocytes. </code> | <code>Monocytes. </code>                                     |
+  | <code>Metamaterials</code>                                   | <code>Sound attenuation optimization using metaporous materials tuned on exceptional points. </code>                | <code>Metamaterials: A cat's eye for all directions. </code> |
+  | <code>Pediatric Parasitology</code>                          | <code>Parasitic infections among school age children 6 to 11-years-of-age in the Eastern province. </code>          | <code>[DIALOGUE ON PEDIATRIC PARASITOLOGY]. </code>          |
 * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
   ```json
   {
 ### Training Hyperparameters
 #### Non-Default Hyperparameters
+- `eval_strategy`: steps
 - `per_device_train_batch_size`: 32
 - `per_device_eval_batch_size`: 32
 - `learning_rate`: 2e-05
+- `num_train_epochs`: 6
 - `lr_scheduler_type`: cosine_with_restarts
 - `warmup_ratio`: 0.1
 - `bf16`: True
 - `overwrite_output_dir`: False
 - `do_predict`: False
+- `eval_strategy`: steps
 - `prediction_loss_only`: True
 - `per_device_train_batch_size`: 32
 - `per_device_eval_batch_size`: 32
 - `adam_beta2`: 0.999
 - `adam_epsilon`: 1e-08
 - `max_grad_norm`: 1.0
+- `num_train_epochs`: 6
 - `max_steps`: -1
 - `lr_scheduler_type`: cosine_with_restarts
 - `lr_scheduler_kwargs`: {}
 </details>
 ### Training Logs
+| Epoch  | Step | Training Loss | triplet-dev_cosine_accuracy |
+|:------:|:----:|:-------------:|:---------------------------:|
+| 0      | 0    | -             | 0.373                       |
+| 0.1667 | 1    | 3.138         | -                           |
+| 0.3333 | 2    | 2.9761        | -                           |
+| 0.5    | 3    | 2.7135        | -                           |
+| 0.6667 | 4    | 2.5144        | -                           |
+| 0.8333 | 5    | 1.9797        | -                           |
+| 1.0    | 6    | 1.2683        | -                           |
+| 1.1667 | 7    | 1.6058        | -                           |
+| 1.3333 | 8    | 1.3236        | -                           |
+| 1.5    | 9    | 1.1134        | -                           |
+| 1.6667 | 10   | 1.1205        | -                           |
+| 1.8333 | 11   | 0.9369        | -                           |
+| 2.0    | 12   | 0.6215        | -                           |
+| 2.1667 | 13   | 1.0374        | -                           |
+| 2.3333 | 14   | 0.9355        | -                           |
+| 2.5    | 15   | 0.7118        | -                           |
+| 2.6667 | 16   | 0.7967        | -                           |
+| 2.8333 | 17   | 0.5739        | -                           |
+| 3.0    | 18   | 0.4515        | -                           |
+| 3.1667 | 19   | 0.8018        | -                           |
+| 3.3333 | 20   | 0.6557        | -                           |
+| 3.5    | 21   | 0.6027        | -                           |
+| 3.6667 | 22   | 0.6747        | -                           |
+| 3.8333 | 23   | 0.5013        | -                           |
+| 4.0    | 24   | 0.1428        | -                           |
+| 4.1667 | 25   | 0.5889        | 0.596                       |
+| 4.3333 | 26   | 0.5439        | -                           |
+| 4.5    | 27   | 0.4742        | -                           |
+| 4.6667 | 28   | 0.5734        | -                           |
+| 4.8333 | 29   | 0.3966        | -                           |
+| 5.0    | 30   | 0.1793        | -                           |
+| 5.1667 | 31   | 0.5408        | -                           |
+| 5.3333 | 32   | 0.5174        | -                           |
+| 5.5    | 33   | 0.4179        | -                           |
+| 5.6667 | 34   | 0.4589        | -                           |
+| 5.8333 | 35   | 0.3683        | -                           |
+| 6.0    | 36   | 0.1442        | 0.606                       |
 ### Framework Versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8bde86f785555d47618677bc7c74848231a3556a1eb547e6ded8a24d9917051b
 size 439696224

 version https://git-lfs.github.com/spec/v1
+oid sha256:08d5e8be928eb50a2410dc88bc791f5b18353249539d816ed452827e06ed169a
 size 439696224