pankajrajdeo commited on
Commit
1ca7522
1 Parent(s): b01769f

Add new SentenceTransformer model.

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 384,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,697 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: pankajrajdeo/UMLS-Pubmed-ST-TCE-Epoch-1
3
+ library_name: sentence-transformers
4
+ pipeline_tag: sentence-similarity
5
+ tags:
6
+ - sentence-transformers
7
+ - sentence-similarity
8
+ - feature-extraction
9
+ - generated_from_trainer
10
+ - dataset_size:137221
11
+ - loss:MultipleNegativesRankingLoss
12
+ widget:
13
+ - source_sentence: What are the underlying physical mechanisms that allow for the
14
+ detection of rotation velocity using orbital angular momentum light spots, even
15
+ when they are completely deviated from the rotation center?
16
+ sentences:
17
+ - 'Pneumothorax following ultrasound-guided jugular vein puncture for central venous
18
+ access in interventional radiology: 4 years of experience. PURPOSE: The purpose
19
+ of our study was to review the rate of pneumothorax following central venous access,
20
+ using real-time ultrasound guidance. MATERIALS AND METHODS: Data related to ultrasound-guided
21
+ venous puncture, for central venous access, performed between July 1, 2004 and
22
+ June 30, 2008 was retrospectively and prospectively collected. Access route, needle
23
+ gauge, catheter type, and diagnosis of pneumothorax on the intraprocedure spot
24
+ radiographs and or the postprocedure chest radiographs, were recorded. RESULTS:
25
+ A total of 1262 ultrasound-guided jugular venous puncture for central venous access
26
+ were performed on a total of 1066 patients between July 1, 2004 and June 30, 2008.
27
+ Access vessels included 983 right internal jugular veins, 275 left internal jugular
28
+ veins, and 4 right external jugular veins. No pneumothorax (0%) was identified.
29
+ CONCLUSION: Due to an extremely low rate of pneumothorax following ultrasound-guided
30
+ central venous access, 0% in our study and other published studies, we suggest
31
+ that routine postprocedure chest radiograph to exclude pneumothorax may be dispensed
32
+ unless it is suspected by the operator or if the patient becomes symptomatic.'
33
+ - 'Targeting CDK4/6 in patients with cancer. The cyclin D-cyclin dependent kinase
34
+ (CDK) 4/6-inhibitor of CDK4 (INK4)-retinoblastoma (Rb) pathway controls cell cycle
35
+ progression by regulating the G1-S checkpoint. Dysregulation of the cyclin D-CDK4/6-INK4-Rb
36
+ pathway results in increased proliferation, and is frequently observed in many
37
+ types of cancer. Pathway activation can occur through a variety of mechanisms,
38
+ including gene amplification or rearrangement, loss of negative regulators, epigenetic
39
+ alterations, and point mutations in key pathway components. Due to the importance
40
+ of CDK4/6 activity in cancer cells, CDK4/6 inhibitors have emerged as promising
41
+ candidates for cancer treatment. Moreover, combination of a CDK4/6 inhibitor with
42
+ other targeted therapies may help overcome acquired or de novo treatment resistance.
43
+ Ongoing studies include combinations of CDK4/6 inhibitors with endocrine therapy
44
+ and phosphatidylinositol 3-kinase (PI3K) pathway inhibitors for hormone receptor-positive
45
+ (HR+) breast cancers, and with selective RAF and MEK inhibitors for tumors with
46
+ alterations in the mitogen activated protein kinase (MAPK) pathway such as melanoma.
47
+ In particular, the combination of CDK4/6 inhibitors with endocrine therapy, such
48
+ as palbociclib''s recent first-line approval in combination with letrozole, is
49
+ expected to transform the treatment of HR+ breast cancer. Currently, three selective
50
+ CDK4/6 inhibitors have been approved or are in late-stage development: palbociclib
51
+ (PD-0332991), ribociclib (LEE011), and abemaciclib (LY2835219). Here we describe
52
+ the current preclinical and clinical data for these novel agents and discuss combination
53
+ strategies with other agents for the treatment of cancer.'
54
+ - Rotation velocity detection with orbital angular momentum light spot completely
55
+ deviated out of the rotation center. Based on the rotational Doppler effect, an
56
+ orbital angular momentum beam can measure the lateral rotation velocity of an
57
+ object, which has broad application prospects. However, all existing research
58
+ focus on the light spot center coinciding with the rotation center, or only with
59
+ small center offset. This is difficult to ensure in remote detection applications.
60
+ In this paper, the rotational Doppler frequency shifts under three cases, including
61
+ no center offset, small center offset and large center offset, are analyzed theoretically.
62
+ Through theoretical research results, a novel method of measuring rotation velocity
63
+ is proposed, with the light spot completely deviated out of the rotation center.
64
+ A laboratory verification experiment shows that this proposed method breaks the
65
+ limit of center offset of lateral rotation velocity measurement and is of great
66
+ significance to the remote detection of non-cooperative rotation object.
67
+ - source_sentence: What are the implications of hydrogen bonding patterns on the supramolecular
68
+ assembly of molecules, and how can these interactions be manipulated or controlled?
69
+ sentences:
70
+ - M.V. Volkenstein, evolutionary thinking and the structure of fitness landscapes.
71
+ High dimensional fitness landscapes are robustly dominated by saddle points, not
72
+ isolated peaks. We present an argument to this effect that is reminiscent of May's
73
+ complexity stability analysis and trace out the significance for the dynamics
74
+ of speciation, the connection between the neutral and punctuated aspects of evolution
75
+ and evolution on moving landscapes. The paper is written in honor of M.V. Volkenstein,
76
+ who devoted his last papers to uniting dynamics with evolutionary thinking.
77
+ - Differential impacts of smoke-free laws on indoor air quality. The authors assessed
78
+ the impacts of two different smoke-free laws on indoor air quality. They compared
79
+ the indoor air quality of 10 hospitality venues in Lexington and Louisville, Kentucky,
80
+ before and after the smoke-free laws went into effect. Real-time measurements
81
+ of particulate matter with aerodynamic diameter of 2.5 microm or smaller (PM2.5)
82
+ were made. One Lexington establishment was excluded from the analysis of results
83
+ because of apparent smoking violation after the law went into effect. The average
84
+ indoor PM2.5 concentrations in the nine Lexington venues decreased 91 percent,
85
+ from 199 to 18 microg/m3. The average indoor PM2.5 concentrations in the 10 Louisville
86
+ venues, however, increased slightly, from 304 to 338 microg/m3. PM2.5 levels in
87
+ the establishments decreased as numbers of burning cigarettes decreased. While
88
+ the Louisville partial smoke-free law with exemptions did not reduce indoor air
89
+ pollution in the selected venues, comprehensive and properly enforced smoke-free
90
+ laws can be an effective means of reducing indoor air pollution.
91
+ - 'Structures of three substituted arenesulfonamides from X-ray powder diffraction
92
+ data using the differential evolution technique. The structures of three substituted
93
+ arenesulfonamides have been solved from laboratory X-ray powder diffraction data,
94
+ using a new direct-space structure solution method based on a differential evolution
95
+ algorithm, and refined by the Rietveld method. In 2-toluenesulfonamide, C(7)H(9)NO(2)S
96
+ (I) (tetragonal I4(1)/a, Z = 16), the molecules are linked by N-H...O=S hydrogen
97
+ bonds into a three-dimensional framework. In 3-nitrobenzenesulfonamide, C(6)H(6)N(2)O(4)S
98
+ (II) (monoclinic P2(1), Z = 2), N-H...O=S hydrogen bonds produce molecular ladders,
99
+ which are linked into sheets by C-H...O=S hydrogen bonds: the nitro group does
100
+ not participate in the hydrogen bonding. Molecules of 4-nitrobenzenesulfonamide,
101
+ C(6)H(6)N(2)O(4)S (III) (monoclinic P2(1)/n, Z = 4), are linked into sheets by
102
+ four types of hydrogen bond, N-H...O=S, N-H...O(nitro), C-H...O=S and C-H...O(nitro),
103
+ and the sheets are weakly linked by aromatic pi...pi stacking interactions.'
104
+ - source_sentence: When ceritinib used instead of crizotinib?
105
+ sentences:
106
+ - 'Pemedolac: a novel and long-acting non-narcotic analgesic. Pemedolacindole-1-acetic
107
+ acid; AY-30,715] exhibited potent analgesic effects against chemically induced
108
+ pain in rats and mice and against inflammatory pain in rats. In each of the animal
109
+ models used the analgesic potency of pemedolac was defined by an ED50 of 2.0 mg/kg
110
+ p.o. or less. Significant analgesic activity was detected in rats at 16 hr after
111
+ administration of 1 mg/kg p.o. (paw pressure test) and at 10 hr after administration
112
+ of 10 mg/kg p.o. to mice (p-phenylbenzoquinone writhing). Inasmuch as pemedolac
113
+ was inactive in the hot plate and tail-flick tests; and its analgesic activity
114
+ was not antagonized by naloxone (1 mg/kg s.c.), and tolerance did not develop
115
+ upon multiple administration; this drug does not exert its analgesic effects through
116
+ an opiate mechanism. Pemedolac differed from standard nonsteroidal anti-inflammatory
117
+ drugs (NSAIDs) in that the doses which produced analgesia were much lower than
118
+ those required for either anti-inflammatory or gastric irritant effects. In acute
119
+ anti-inflammatory tests, pemedolac exhibited only weak activity as evidenced by
120
+ an ED50 approximately 100 mg/kg p.o. in the carrageenan paw edema procedure. This
121
+ demonstrates for pemedolac a separation of at least 50-fold between the acute
122
+ analgesic and anti-inflammatory activities, which was greater than that observed
123
+ with reference NSAIDs. The compound also had a low ulcerogenic liability with
124
+ an acute UD50 = 107 mg/kg p.o. and a subacute UD50 estimated to be 140 mg/kg/day
125
+ p.o. In contrast, the reference NSAIDS (piroxicam, indomethacin, naproxen and
126
+ ibuprofen) exhibited similar dose-response relationships for the analgesic, anti-inflammatory
127
+ and gastric irritant effects.(ABSTRACT TRUNCATED AT 250 WORDS).'
128
+ - Genome-wide detection of CNVs associated with beak deformity in chickens using
129
+ high-density 600K SNP arrays. Beak deformity (crossed beaks) is found in several
130
+ indigenous chicken breeds including Beijing-You studied here. Birds with deformed
131
+ beaks have reduced feed intake and poor production performance. Recently, copy
132
+ number variation (CNV) has been examined in many species and is recognized as
133
+ a source of genetic variation, especially for disease phenotypes. In this study,
134
+ to unravel the genetic mechanisms underlying beak deformity, we performed genome-wide
135
+ CNV detection using Affymetrix chicken high-density 600K data on 48 deformed-beak
136
+ and 48 normal birds using penncnv. As a result, two and eight CNV regions (CNVRs)
137
+ covering 0.32 and 2.45 Mb respectively on autosomes were identified in deformed-beak
138
+ and normal birds respectively. Further RT-qPCR studies validated nine of the 10
139
+ CNVRs. The ratios of six CNVRs were significantly different between deformed-beak
140
+ and normal birds (P < 0.01). Within these six regions, three and 21 known genes
141
+ were identified in deformed-beak and normal birds respectively. Bioinformatics
142
+ analysis showed that these genes were enriched in six GO terms and one KEGG pathway.
143
+ Five candidate genes in the CNVRs were further validated using RT-qPCR. The expression
144
+ of LRIG2 (leucine rich repeats and immunoglobulin like domains 2) was lower in
145
+ birds with deformed beaks (P < 0.01). Therefore, the LRIG2 gene could be considered
146
+ a key factor in view of its known functions and its potential roles in beak deformity.
147
+ Overall, our results will be helpful for future investigations of the genomic
148
+ structural variations underlying beak deformity in chickens.
149
+ - 'Ceritinib: a new tyrosine kinase inhibitor for non-small-cell lung cancer. OBJECTIVE:
150
+ To review ceritinib for the treatment of anaplastic lymphoma kinase (ALK)-positive
151
+ metastatic non-small-cell lung cancer (NSCLC). DATA SOURCES: Literature searches
152
+ were conducted in PubMed, EMBASE (1974 to July week 5, 2014), and Google Scholar
153
+ using the terms ceritinib, LDK378, and non-small-cell lung cancer. STUDY SELECTION
154
+ AND DATA EXTRACTION: One phase 1 trial and 2 abstracts were identified. DATA SYNTHESIS:
155
+ Ceritinib is approved for the treatment of ALK-positive metastatic NSCLC in patients
156
+ who are intolerant to or have progressed despite therapy with crizotinib. In the
157
+ phase 1 clinical trial, the maximum tolerated dose was determined to be 750 mg
158
+ once daily. The overall response rate (ORR) was 58% (95% CI = 48-67) in patients
159
+ who received ≥400 mg daily (n = 114). In this group, the ORR was 56% (95% CI =
160
+ 41-67) and 62% (95% CI = 44-78) among crizotinib-exposed and -naïve patients,
161
+ respectively. The ORR was 59% (95% CI = 47-70) in patients who received 750 mg
162
+ daily (n = 78). The ORR was 56% (95% CI = 41-70) in crizotinib-treated patients
163
+ and 64% (95% CI = 44-81) in crizotinib-naïve patients, respectively, in this subset.
164
+ The median duration of response was 8.2 months. Median progression-free survival
165
+ was 7.0 months. The most common adverse reactions included diarrhea, nausea, vomiting,
166
+ abdominal pain, anorexia, constipation, fatigue, and elevated transaminases. CONCLUSIONS:
167
+ Ceritinib has activity in crizotinib-resistant and crizotinib-naïve patients and
168
+ appears to be a viable alternative for ALK-positive NSCLC. Long-term data are
169
+ needed to further define the role of ceritinib in the treatment of NSCLC.'
170
+ - source_sentence: How do underlying physiological mechanisms influence the relationship
171
+ between muscle tension and headache disorders?
172
+ sentences:
173
+ - 'The correlation of lncRNA SNHG16 with inflammatory cytokines, adhesion molecules,
174
+ disease severity, and prognosis in acute ischemic stroke patients. BACKGROUND:
175
+ Long non-coding RNA small nucleolar RNA host gene 16 (lncRNA SNHG16) is involved
176
+ in the pathogenesis of acute ischemic stroke (AIS) through the regulation of brain
177
+ endothelial cell viability, inflammation, atherosclerotic plaque formation, and
178
+ neural apoptosis. This study aimed to evaluate the prognostic value of lncRNA
179
+ SNHG16 in AIS patients. METHODS: Newly diagnosed AIS patients (N = 120) were serially
180
+ recruited. Their lncRNA SNHG16 expressions in peripheral blood mononuclear cells
181
+ (PBMCs) were detected by reverse transcription-quantitative polymerase chain reaction
182
+ (RT-qPCR); serum inflammatory cytokines and adhesion molecules were determined
183
+ using enzyme-linked immunosorbent assay (ELISA). The accumulating recurrence-free
184
+ survival (RFS) and overall survival (OS) were analyzed. Moreover, controls (N
185
+ = 60) were recruited and their lncRNA SNHG16 expressions in PBMCs were detected.
186
+ RESULTS: LncRNA SNHG16 was declined in AIS patients compared to controls (p <
187
+ 0.001). Moreover, lncRNA SNHG16 was not related to any comorbidities in AIS patients
188
+ (all p > 0.05). Interestingly, lncRNA SNHG16 was negatively related to tumor necrosis
189
+ factor alpha (TNF-α) (p < 0.001), interleukin 6 (IL-6) (p = 0.013), and intracellular
190
+ cell adhesion molecule-1 (ICAM-1) (p = 0.024), while positively correlated with
191
+ interleukin 10 (IL-10) (p = 0.022) in AIS patients. Besides, lncRNA SNHG16 was
192
+ inversely associated with the National Institutes of Health Stroke Scale (NIHSS)
193
+ score in AIS patients (p = 0.003). During the follow-up period, in 14 (11.7%)
194
+ patients occurred recurrence and 5 (4.2%) patients died. Unexpectedly, lncRNA
195
+ SNHG16 was not associated with accumulating RFS (p = 0.103) or OS (p = 0.150)
196
+ in AIS patients. CONCLUSION: LncRNA SNHG16 relates to lower inflammatory cytokines,
197
+ adhesion molecules, and milder disease severity, but fails to predict prognosis
198
+ in AIS patients.'
199
+ - 'Burning mouth syndrome: a discussion of a complex pathology. Burning mouth syndrome
200
+ is a complex pathology for which there is very little information about the etiology
201
+ and pathogenesis. This lack of knowledge leaves patients with suboptimal treatments.
202
+ This article discusses the existing scientific evidence about this disease. Since
203
+ topical oral use of clonazepam have been shown to be effective and safe to treat
204
+ some patients suffering with burning mouth syndrome, formulations including clonazepam
205
+ are included with this article. Compounding topical preparations of clonazepam
206
+ offers opportunities for compounding pharmacists to be more involved in improving
207
+ the quality of life of burning mouth syndrome patients.'
208
+ - 'Tension headaches and muscle tension: is there a role for magnesium? Although
209
+ many theories and hypotheses have been offered for the etiology of tension-type
210
+ headache (TH), no one previous hypothesis seems to adequately explain TH. This
211
+ may, in large measure, account for why it is often difficult to effectively treat
212
+ TH. Herein, we review current and old hypotheses of TH and offer a new hypothesis
213
+ which is consistent with what is known about TH. We show that magnesium (Mg) metabolism
214
+ may be pivotal in both the etiology and treatment of TH. Measurement of serum
215
+ ionized Mg2+ (IMg2+) levels and brain intracellular free Mg2+ (i) appear to offer
216
+ excellent methods for establishing the validity of our hypothesis. Since approximately
217
+ 70% of patients who have a TH exhibit muscular tightness and tenderness, it is
218
+ distinctly possible that problems in Mg metabolism and dietary intake are the
219
+ links to concomitant muscle tension and TH. The significance of release of pain
220
+ mediators, muscle cramps, muscle strains (and damage) and muscle tension to TH,
221
+ and its relationship to Mg metabolism, are reviewed. These are all associated
222
+ with a Mg-deficient state. It seems clear from the available data that TH''s are
223
+ more associated with muscle tension or scalp tension than any other headache type.
224
+ From the data available, Mg supplementation appears to be of great benefit in
225
+ many of these situations. We believe there is a great need for clinicians to examine
226
+ Mg2+ metabolism, bioavailable Mg2+ in muscle tissues and blood, and the effectiveness
227
+ of Mg salts (in a double-blinded, placebo-controlled manner) in subjects with
228
+ TH and muscle tension.'
229
+ - source_sentence: How do the structural properties of nanoporous materials influence
230
+ their efficiency in catalytic reactions?
231
+ sentences:
232
+ - 'Rationale and design of the Kanyini guidelines adherence with the polypill (Kanyini-GAP)
233
+ study: a randomised controlled trial of a polypill-based strategy amongst indigenous
234
+ and non indigenous people at high cardiovascular risk. BACKGROUND: The Kanyini
235
+ Guidelines Adherence with the Polypill (Kanyini-GAP) Study aims to examine whether
236
+ a polypill-based strategy (using a single capsule containing aspirin, a statin
237
+ and two blood pressure-lowering agents) amongst Indigenous and non-Indigenous
238
+ people at high risk of experiencing a cardiovascular event will improve adherence
239
+ to guideline-indicated therapies, and lower blood pressure and cholesterol levels.
240
+ METHODS/DESIGN: The study is an open, randomised, controlled, multi-centre trial
241
+ involving 1000 participants at high risk of cardiovascular events recruited from
242
+ mainstream general practices and Aboriginal Medical Services, followed for an
243
+ average of 18 months. The participants will be randomised to one of two versions
244
+ of the polypill, the version chosen by the treating health professional according
245
+ to clinical features of the patient, or to usual care. The primary study outcomes
246
+ will be changes, from baseline measures, in serum cholesterol and systolic blood
247
+ pressure and self-reported current use of aspirin, a statin and at least two blood
248
+ pressure lowering agents. Secondary study outcomes include cardiovascular events,
249
+ renal outcomes, self-reported barriers to indicated therapy, prescription of indicated
250
+ therapy, occurrence of serious adverse events and changes in quality-of-life.
251
+ The trial will be supplemented by formal economic and process evaluations. DISCUSSION:
252
+ The Kanyini-GAP trial will provide new evidence as to whether or not a polypill-based
253
+ strategy improves adherence to effective cardiovascular medications amongst individuals
254
+ in whom these treatments are indicated. TRIAL REGISTRATION: This trial is registered
255
+ with the Australian New Zealand Clinical Trial Registry ACTRN126080005833347.'
256
+ - A highly robust cluster-based indium(III)-organic framework with efficient catalytic
257
+ activity in cycloaddition of CO2 and Knoevenagel condensation. The efficient catalytic
258
+ performance displayed by MOFs is decided by an appropriate charge/radius ratio
259
+ of defect metal sites, large enough solvent-accessible channels and Lewis base
260
+ sites capable of polarizing substrate molecules. Herein, the solvothermal self-assembly
261
+ led to a highly robust nanochannel-based framework of {·2DMF·5H2O}n (NUC-66) with
262
+ a 56.8% void volume, which is a combination of a tetranuclear cluster (abbreviated
263
+ as {In4}) and a conjugated tetracyclic pentacarboxylic acid ligand of 4,4'-(4-(4-carboxyphenyl)pyridine-2,6-diyl)diisophthalic
264
+ acid (H5CPDD). To the best of our knowledge, NUC-66 is a rarely reported {In4}-based
265
+ 3D framework with embedded hierarchical triangular-microporous (2.9 Å) and hexagonal-nanoporous
266
+ (12.0 Å) channels, which are shaped by six rows of {In4} clusters. After solvent
267
+ exchange and vacuum drying, the surface of nanochannels in desolvated NUC-66a
268
+ is modified by unsaturated In3+ ions, Npyridine atoms and μ3-OH groups, all of
269
+ which display polarization ability towards polar molecules due to their Lewis
270
+ acidity or basicity. The catalytic experiments performed showed that NUC-66a had
271
+ high catalytic activity in the cycloaddition reactions of epoxides with CO2 under
272
+ mild conditions, which should be ascribed to its structural advantages including
273
+ nanoscale channels, rich bifunctional active sites, large surface areas and chemical
274
+ stability. Moreover, NUC-66a, as a heterogeneous catalyst, could greatly accelerate
275
+ the Knoevenagel condensation reactions of aldehydes and malononitrile. Hence,
276
+ this work confirms that the development of rigid nanoporous cluster-based MOFs
277
+ built on metal ions with a high charge and large radius ratio will be more likely
278
+ to realize practical applications, such as catalysis, adsorption and separation
279
+ of gas, etc.
280
+ - Absolute quantification of dehydroacetic acid in processed foods using quantitative
281
+ 1H NMR. An absolute quantification method for the determination of dehydroacetic
282
+ acid in processed foods using quantitative (1)H NMR was developed and validated.
283
+ The level of dehydroacetic acid was determined using the proton signals of dehydroacetic
284
+ acid referenced to 1,4-bis (trimethylsilyl) benzene-d4 after simple solvent extraction
285
+ from processed foods. All the recoveries from three processed foods spiked at
286
+ two different concentrations were larger than 85%. The proposed method also proved
287
+ to be precise, with inter-day precision and excellent linearity. The limit of
288
+ quantification was confirmed as 0.13g/kg in processed foods, which is sufficiently
289
+ low for the purposes of monitoring dehydroacetic acid. Furthermore, the method
290
+ is rapid and easy to apply, and provides International System of Units traceability
291
+ without the need for authentic analyte reference materials. Therefore, the proposed
292
+ method is a useful and practical tool for determining the level of dehydroacetic
293
+ acid in processed foods.
294
+ ---
295
+
296
+ # SentenceTransformer based on pankajrajdeo/UMLS-Pubmed-ST-TCE-Epoch-1
297
+
298
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [pankajrajdeo/UMLS-Pubmed-ST-TCE-Epoch-1](https://huggingface.co/pankajrajdeo/UMLS-Pubmed-ST-TCE-Epoch-1). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
299
+
300
+ ## Model Details
301
+
302
+ ### Model Description
303
+ - **Model Type:** Sentence Transformer
304
+ - **Base model:** [pankajrajdeo/UMLS-Pubmed-ST-TCE-Epoch-1](https://huggingface.co/pankajrajdeo/UMLS-Pubmed-ST-TCE-Epoch-1) <!-- at revision 0c787217b5a51d93286052fb773dea46ff9b1e57 -->
305
+ - **Maximum Sequence Length:** 1024 tokens
306
+ - **Output Dimensionality:** 384 tokens
307
+ - **Similarity Function:** Cosine Similarity
308
+ <!-- - **Training Dataset:** Unknown -->
309
+ <!-- - **Language:** Unknown -->
310
+ <!-- - **License:** Unknown -->
311
+
312
+ ### Model Sources
313
+
314
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
315
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
316
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
317
+
318
+ ### Full Model Architecture
319
+
320
+ ```
321
+ SentenceTransformer(
322
+ (0): Transformer({'max_seq_length': 1024, 'do_lower_case': False}) with Transformer model: BertModel
323
+ (1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
324
+ )
325
+ ```
326
+
327
+ ## Usage
328
+
329
+ ### Direct Usage (Sentence Transformers)
330
+
331
+ First install the Sentence Transformers library:
332
+
333
+ ```bash
334
+ pip install -U sentence-transformers
335
+ ```
336
+
337
+ Then you can load this model and run inference.
338
+ ```python
339
+ from sentence_transformers import SentenceTransformer
340
+
341
+ # Download from the 🤗 Hub
342
+ model = SentenceTransformer("pankajrajdeo/UMLS-Pubmed-ST-TCE-Epoch-1-QA_100K-BioASQ-Epoch_5")
343
+ # Run inference
344
+ sentences = [
345
+ 'How do the structural properties of nanoporous materials influence their efficiency in catalytic reactions?',
346
+ "A highly robust cluster-based indium(III)-organic framework with efficient catalytic activity in cycloaddition of CO2 and Knoevenagel condensation. The efficient catalytic performance displayed by MOFs is decided by an appropriate charge/radius ratio of defect metal sites, large enough solvent-accessible channels and Lewis base sites capable of polarizing substrate molecules. Herein, the solvothermal self-assembly led to a highly robust nanochannel-based framework of {·2DMF·5H2O}n (NUC-66) with a 56.8% void volume, which is a combination of a tetranuclear cluster (abbreviated as {In4}) and a conjugated tetracyclic pentacarboxylic acid ligand of 4,4'-(4-(4-carboxyphenyl)pyridine-2,6-diyl)diisophthalic acid (H5CPDD). To the best of our knowledge, NUC-66 is a rarely reported {In4}-based 3D framework with embedded hierarchical triangular-microporous (2.9 Å) and hexagonal-nanoporous (12.0 Å) channels, which are shaped by six rows of {In4} clusters. After solvent exchange and vacuum drying, the surface of nanochannels in desolvated NUC-66a is modified by unsaturated In3+ ions, Npyridine atoms and μ3-OH groups, all of which display polarization ability towards polar molecules due to their Lewis acidity or basicity. The catalytic experiments performed showed that NUC-66a had high catalytic activity in the cycloaddition reactions of epoxides with CO2 under mild conditions, which should be ascribed to its structural advantages including nanoscale channels, rich bifunctional active sites, large surface areas and chemical stability. Moreover, NUC-66a, as a heterogeneous catalyst, could greatly accelerate the Knoevenagel condensation reactions of aldehydes and malononitrile. Hence, this work confirms that the development of rigid nanoporous cluster-based MOFs built on metal ions with a high charge and large radius ratio will be more likely to realize practical applications, such as catalysis, adsorption and separation of gas, etc.",
347
+ 'Absolute quantification of dehydroacetic acid in processed foods using quantitative 1H NMR. An absolute quantification method for the determination of dehydroacetic acid in processed foods using quantitative (1)H NMR was developed and validated. The level of dehydroacetic acid was determined using the proton signals of dehydroacetic acid referenced to 1,4-bis (trimethylsilyl) benzene-d4 after simple solvent extraction from processed foods. All the recoveries from three processed foods spiked at two different concentrations were larger than 85%. The proposed method also proved to be precise, with inter-day precision and excellent linearity. The limit of quantification was confirmed as 0.13g/kg in processed foods, which is sufficiently low for the purposes of monitoring dehydroacetic acid. Furthermore, the method is rapid and easy to apply, and provides International System of Units traceability without the need for authentic analyte reference materials. Therefore, the proposed method is a useful and practical tool for determining the level of dehydroacetic acid in processed foods.',
348
+ ]
349
+ embeddings = model.encode(sentences)
350
+ print(embeddings.shape)
351
+ # [3, 384]
352
+
353
+ # Get the similarity scores for the embeddings
354
+ similarities = model.similarity(embeddings, embeddings)
355
+ print(similarities.shape)
356
+ # [3, 3]
357
+ ```
358
+
359
+ <!--
360
+ ### Direct Usage (Transformers)
361
+
362
+ <details><summary>Click to see the direct usage in Transformers</summary>
363
+
364
+ </details>
365
+ -->
366
+
367
+ <!--
368
+ ### Downstream Usage (Sentence Transformers)
369
+
370
+ You can finetune this model on your own dataset.
371
+
372
+ <details><summary>Click to expand</summary>
373
+
374
+ </details>
375
+ -->
376
+
377
+ <!--
378
+ ### Out-of-Scope Use
379
+
380
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
381
+ -->
382
+
383
+ <!--
384
+ ## Bias, Risks and Limitations
385
+
386
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
387
+ -->
388
+
389
+ <!--
390
+ ### Recommendations
391
+
392
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
393
+ -->
394
+
395
+ ## Training Details
396
+
397
+ ### Training Dataset
398
+
399
+ #### Unnamed Dataset
400
+
401
+
402
+ * Size: 137,221 training samples
403
+ * Columns: <code>anchor</code> and <code>positive</code>
404
+ * Approximate statistics based on the first 1000 samples:
405
+ | | anchor | positive |
406
+ |:--------|:----------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
407
+ | type | string | string |
408
+ | details | <ul><li>min: 6 tokens</li><li>mean: 23.82 tokens</li><li>max: 47 tokens</li></ul> | <ul><li>min: 27 tokens</li><li>mean: 281.43 tokens</li><li>max: 915 tokens</li></ul> |
409
+ * Samples:
410
+ | anchor | positive |
411
+ |:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
412
+ | <code>How do menstrual-related factors, such as pain and cycle irregularity, impact the mental health and well-being of young women in educational settings?</code> | <code>Determinants of premenstrual dysphoric disorder and associated factors among regular undergraduate students at Hawassa University Southern, Ethiopia, 2023: institution-based cross-sectional study. BACKGROUND: Premenstrual dysphoric disorder (PMDD) is a condition causing severe emotional, physical, and behavioral symptoms before menstruation. It greatly hinders daily activities, affecting academic and interpersonal relationships. Attention is not given to premenstrual disorders among female students in higher education. As a result, students are susceptible to stress, and their academic success is influenced by various factors, including their menstrual cycle, and the long-term outcomes and consequences are poorly researched. Even though PMDD has a significant negative impact on student's academic achievement and success limited research has been conducted in low- and middle-income countries including Ethiopia, especially in the study setting. Therefore, a study is needed to assess premenstrual dysphoric disorder and associated factors among regular undergraduate students at Hawassa University. METHODS: An institutional-based cross-sectional study was conducted among 374 regular undergraduate female students at Hawassa University, College of Medicine and Health Sciences. A self-administered structured premenstrual symptoms screening tool for adolescents was used to assess premenstrual dysphoric disorder. The collected data were loaded into a statistical package for the social science version 25 and analyzed using it. Both bivariate and multivariate logistic regression were used to identify factors associated with premenstrual dysphoric disorder. Each independent variable was entered separately into bivariate analysis, and a variable with a p-value less than 0.25 were included in the multivariate analysis to adjust the possible confounders. Statistically significant was declared at a 95% confidence interval when variable with a p-value less than 0.05 in the multivariate analysis with premenstrual dysphoric disorder. RESULTS: The magnitude of premenstrual dysphoric disorder in this study was 62.6% (95% CI 57.4-67.5). Having severe premenstrual pain (AOR = 6.44;95%CI 1.02-40.73), having irregular menstrual cycle (AOR = 2.21; 95% CI 1.32-3.70), students who had poor social support (AOR = 5.10;95%CI, (2.76-12.92) and moderate social support (AOR = 4.93;95%CI (2.18-11.18), and students who used contraception (AOR = 3.76;95%CI, 2.21-6,40) were statistically significant factors with the outcome variable. CONCLUSION: The prevalence of premenstrual dysphoric disorder was high as compared to other studies. There was a strong link between irregular menstrual cycle, severe menstrual pain (severe dysmenorrhea), poor social support, and contraception use with premenstrual dysphoric disorder. This needs early screening and intervention to prevent the complications and worsening of the symptoms that affect students' academic performance by the institution.</code> |
413
+ | <code>How do sleep patterns influence cognitive function and learning in humans, and what are the broader implications for understanding neurological disorders?</code> | <code>Neurochemical mechanisms for memory processing during sleep: basic findings in humans and neuropsychiatric implications. Sleep is essential for memory formation. Active systems consolidation maintains that memory traces that are initially stored in a transient store such as the hippocampus are gradually redistributed towards more permanent storage sites such as the cortex during sleep replay. The complementary synaptic homeostasis theory posits that weak memory traces are erased during sleep through a competitive down-selection mechanism, ensuring the brain's capability to learn new information. We discuss evidence from neuropharmacological experiments in humans to show how major neurotransmitters and neuromodulators are implicated in these memory processes. As to the major excitatory neurotransmitter glutamate that plays a prominent role in inducing synaptic consolidation, we show that these processes, while strengthening cortical memory traces during sleep, are insufficient to explain the consolidation of hippocampus-dependent declarative memories. In the inhibitory GABAergic system, we will offer insights how drugs may alter the intricate interplay of sleep oscillations that have been identified to be crucial for strengthening memories during sleep. Regarding the dopaminergic reward system, we will show how it is engaged during sleep replay, but that dopaminergic neuromodulation likely plays a side role for enhancing relevant memories during sleep. Also, we briefly go into basic evidence on acetylcholine and cortisol whose low tone during slow wave sleep (SWS) is crucial in supporting hippocampal-to-neocortical memory transmission. Finally, we will outline how these insights can be used to improve treatment of neuropsychiatric disorders focusing mainly on anxiety disorders, depression, and addiction that are strongly related to memory processing.</code> |
414
+ | <code>What are the underlying physiological mechanisms by which elevated brain natriuretic peptide levels interact with heart rate variability to increase the likelihood of cardiovascular events?</code> | <code>The Combination of Non-dipper Heart Rate and High Brain Natriuretic Peptide Predicts Cardiovascular Events: The Japan Morning Surge-Home Blood Pressure (J-HOP) Study. BACKGROUND: We hypothesized that the association between the dipping heart rate (HR) pattern and cardiovascular (CV) events differs according to the brain natriuretic peptide (BNP) level. METHODS: We examined a subgroup of 1,369 patients from the Japan Morning Surge Home Blood Pressure study; these were patients who had CV risk factors and had undergone ambulatory blood pressure (BP) monitoring. HR non-dipping status was defined as (awake HR - sleep HR)/awake HR <0.1, and high BNP was defined as ≥35 pg/ml. We divided the patients into four groups according to their HR dipper status (dipping or non-dipping) and BNP level (normal or high). RESULTS: The mean follow-up period was 60 ± 30 months. The primary endpoints were fatal/nonfatal CV events (myocardial infarction, angina pectoris, stroke, hospitalization for heart failure, and aortic dissection). During the follow-up period, 23 patients (2.8%) in the dipper HR with normal BNP group, 8 patients (4.4%) in the non-dipper HR with normal BNP group, 24 patients (9.5%) in the dipper HR with high-BNP group, and 25 patients (21.0%) in the non-dipper HR with high-BNP group suffered primary endpoints (log rank 78.8, P < 0.001). Non-dipper HR was revealed as an independent predictor of CV events (hazard ratio, 2.13; 95% confidence interval, 1.35-3.36; P = 0.001) after adjusting for age, gender and smoking, dyslipidemia, diabetes mellitus, chronic kidney disease, BNP, non-dipper BP, 24-h HR, and 24-h systolic blood pressure. CONCLUSIONS: The combination of non-dipper HR and higher BNP was associated with a higher incidence of CV events.</code> |
415
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
416
+ ```json
417
+ {
418
+ "scale": 20.0,
419
+ "similarity_fct": "cos_sim"
420
+ }
421
+ ```
422
+
423
+ ### Evaluation Dataset
424
+
425
+ #### Unnamed Dataset
426
+
427
+
428
+ * Size: 15,247 evaluation samples
429
+ * Columns: <code>anchor</code> and <code>positive</code>
430
+ * Approximate statistics based on the first 1000 samples:
431
+ | | anchor | positive |
432
+ |:--------|:----------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|
433
+ | type | string | string |
434
+ | details | <ul><li>min: 6 tokens</li><li>mean: 24.38 tokens</li><li>max: 46 tokens</li></ul> | <ul><li>min: 25 tokens</li><li>mean: 280.0 tokens</li><li>max: 866 tokens</li></ul> |
435
+ * Samples:
436
+ | anchor | positive |
437
+ |:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
438
+ | <code>What are the underlying mechanisms by which electroporation enhances the immunogenicity of low-dose DNA vaccines, and what implications does this have for vaccine design and efficacy?</code> | <code>Immunotherapeutic Effects of Different Doses of Mycobacterium tuberculosis ag85a/b DNA Vaccine Delivered by Electroporation. Background: Tuberculosis (TB) is a major global public health problem. New treatment methods on TB are urgently demanded. Methods: Ninety-six female BALB/c mice were challenged with 2×104 colony-forming units (CFUs) of MTB H37Rv through tail vein injection, then was treated with 10μg, 50μg, 100μg, and 200μg of Mycobacterium tuberculosis (MTB) ag85a/b chimeric DNA vaccine delivered by intramuscular injection (IM) and electroporation (EP), respectively. The immunotherapeutic effects were evaluated immunologically, bacteriologically, and pathologically. Results: Compared with the phosphate-buffered saline (PBS) group, the CD4+IFN-γ+ T cells% in whole blood from 200 μg DNA IM group and four DNA EP groups increased significantly (P<0.05), CD8+IFN-γ+ T cells% (in 200 μg DNA EP group), CD4+IL-4+ T cells% (50 μg DNA IM group) and CD8+IL-4+ T cells% (50 μg and 100 μg DNA IM group, 100 μg and 200 μg DNA EP group) increased significantly only in a few DNA groups (P< 0.05). The CD4+CD25+ Treg cells% decreased significantly in all DNA vaccine groups (P<0.01). Except for the 10 μg DNA IM group, the lung and spleen colony-forming units (CFUs) of the other seven DNA immunization groups decreased significantly (P<0.001, P<0.01), especially the 100 μg DNA IM group and 50 μg DNA EP group significantly reduced the pulmonary bacterial loads and lung lesions than the other DNA groups. Conclusions: An MTB ag85a/b chimeric DNA vaccine could induce Th1-type cellular immune reactions. DNA immunization by EP could improve the immunogenicity of the low-dose DNA vaccine, reduce DNA dose, and produce good immunotherapeutic effects on the mouse TB model, to provide the basis for the future human clinical trial of MTB ag85a/b chimeric DNA vaccine.</code> |
439
+ | <code>What is known about prostate cancer screening in the UK</code> | <code>Supporting informed decision making online in 20 minutes: an observational web-log study of a PSA test decision aid. BACKGROUND: Web-based decision aids are known to have an effect on knowledge, attitude, and behavior; important components of informed decision making. We know what decision aids achieve in randomized controlled trials (RCTs), but we still know very little about how they are used and how this relates to the informed decision making outcome measures. OBJECTIVE: To examine men's use of an online decision aid for prostate cancer screening using website transaction log files (web-logs), and to examine associations between usage and components of informed decision making. METHODS: We conducted an observational web-log analysis of users of an online decision aid, Prosdex. Men between 50 and 75 years of age were recruited for an associated RCT from 26 general practices across South Wales, United Kingdom. Men allocated to one arm of the RCT were included in the current study. Time and usage data were derived from website log files. Components of informed decision making were measured by an online questionnaire. RESULTS: Available for analysis were 82 web-logs. Overall, there was large variation in the use of Prosdex. The mean total time spent on the site was 20 minutes. The mean number of pages accessed was 32 (SD 21) out of a possible 60 pages. Significant associations were found between increased usage and increased knowledge (Spearman rank correlation [rho] = 0.69, P < .01), between increased usage and less favorable attitude towards PSA testing (rho = -0.52, P < .01), and between increased usage and reduced intention to undergo PSA testing (rho = -0.44, P < .01). A bimodal distribution identified two types of user: low access and high access users. CONCLUSIONS: Increased usage of Prosdex leads to more informed decision making, the key aim of the UK Prostate Cancer Risk Management Programme. However, developers realistically have roughly 20 minutes to provide useful information that will support informed decision making when the patient uses a web-based interface. Future decision aids need to be developed with this limitation in mind. We recommend that web-log analysis should be an integral part of online decision aid development and analysis. TRIAL REGISTRATION: ISRCTN48473735; http://www.controlled-trials.com/ISRCTN48473735 (Archived by WebCite at http://www.webcitation.org/5pqeF89tS).</code> |
440
+ | <code>How does early life adiposity influence long-term cardiovascular health, and what are the implications for prevention and intervention strategies?</code> | <code>Adiposity is associated with endothelial activation in healthy 2-3 year-old children. Adiposity is associated with C-reactive protein level in healthy 2-3 year-old children and with other markers of endothelial activation in adults, but data are lacking in very young children. Data from 491 healthy Hispanic children were analyzed. Mean age was 2.7 years (SD 0.5, range 2-3 years); mean body mass index (BMI) was 17.2 kg/m2 (SD 1.9) among boys and 17.1 kg/m2 (SD 2.1) among girls. E-selectin level was associated with BMI (R = 0.11; p < 0.02), ponderal index (p < 0.02), waist circumference (p = 0.02), fasting insulin (p < 0.02), and insulin resistance (p < or = 0.05); these associations remained significant after adjustment for age, sex and fasting glucose. sVCAM was also associated with BMI (R = 0.12; p < 0.05). These observations indicate that adiposity is associated with inflammation and endothelial activation in very early childhood.</code> |
441
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
442
+ ```json
443
+ {
444
+ "scale": 20.0,
445
+ "similarity_fct": "cos_sim"
446
+ }
447
+ ```
448
+
449
+ ### Training Hyperparameters
450
+ #### Non-Default Hyperparameters
451
+
452
+ - `eval_strategy`: epoch
453
+ - `per_device_train_batch_size`: 128
454
+ - `per_device_eval_batch_size`: 16
455
+ - `learning_rate`: 2e-05
456
+ - `weight_decay`: 0.01
457
+ - `num_train_epochs`: 5
458
+ - `warmup_ratio`: 0.1
459
+ - `fp16`: True
460
+ - `load_best_model_at_end`: True
461
+ - `resume_from_checkpoint`: True
462
+
463
+ #### All Hyperparameters
464
+ <details><summary>Click to expand</summary>
465
+
466
+ - `overwrite_output_dir`: False
467
+ - `do_predict`: False
468
+ - `eval_strategy`: epoch
469
+ - `prediction_loss_only`: True
470
+ - `per_device_train_batch_size`: 128
471
+ - `per_device_eval_batch_size`: 16
472
+ - `per_gpu_train_batch_size`: None
473
+ - `per_gpu_eval_batch_size`: None
474
+ - `gradient_accumulation_steps`: 1
475
+ - `eval_accumulation_steps`: None
476
+ - `torch_empty_cache_steps`: None
477
+ - `learning_rate`: 2e-05
478
+ - `weight_decay`: 0.01
479
+ - `adam_beta1`: 0.9
480
+ - `adam_beta2`: 0.999
481
+ - `adam_epsilon`: 1e-08
482
+ - `max_grad_norm`: 1.0
483
+ - `num_train_epochs`: 5
484
+ - `max_steps`: -1
485
+ - `lr_scheduler_type`: linear
486
+ - `lr_scheduler_kwargs`: {}
487
+ - `warmup_ratio`: 0.1
488
+ - `warmup_steps`: 0
489
+ - `log_level`: passive
490
+ - `log_level_replica`: warning
491
+ - `log_on_each_node`: True
492
+ - `logging_nan_inf_filter`: True
493
+ - `save_safetensors`: True
494
+ - `save_on_each_node`: False
495
+ - `save_only_model`: False
496
+ - `restore_callback_states_from_checkpoint`: False
497
+ - `no_cuda`: False
498
+ - `use_cpu`: False
499
+ - `use_mps_device`: False
500
+ - `seed`: 42
501
+ - `data_seed`: None
502
+ - `jit_mode_eval`: False
503
+ - `use_ipex`: False
504
+ - `bf16`: False
505
+ - `fp16`: True
506
+ - `fp16_opt_level`: O1
507
+ - `half_precision_backend`: auto
508
+ - `bf16_full_eval`: False
509
+ - `fp16_full_eval`: False
510
+ - `tf32`: None
511
+ - `local_rank`: 0
512
+ - `ddp_backend`: None
513
+ - `tpu_num_cores`: None
514
+ - `tpu_metrics_debug`: False
515
+ - `debug`: []
516
+ - `dataloader_drop_last`: False
517
+ - `dataloader_num_workers`: 0
518
+ - `dataloader_prefetch_factor`: None
519
+ - `past_index`: -1
520
+ - `disable_tqdm`: False
521
+ - `remove_unused_columns`: True
522
+ - `label_names`: None
523
+ - `load_best_model_at_end`: True
524
+ - `ignore_data_skip`: False
525
+ - `fsdp`: []
526
+ - `fsdp_min_num_params`: 0
527
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
528
+ - `fsdp_transformer_layer_cls_to_wrap`: None
529
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
530
+ - `deepspeed`: None
531
+ - `label_smoothing_factor`: 0.0
532
+ - `optim`: adamw_torch
533
+ - `optim_args`: None
534
+ - `adafactor`: False
535
+ - `group_by_length`: False
536
+ - `length_column_name`: length
537
+ - `ddp_find_unused_parameters`: None
538
+ - `ddp_bucket_cap_mb`: None
539
+ - `ddp_broadcast_buffers`: False
540
+ - `dataloader_pin_memory`: True
541
+ - `dataloader_persistent_workers`: False
542
+ - `skip_memory_metrics`: True
543
+ - `use_legacy_prediction_loop`: False
544
+ - `push_to_hub`: False
545
+ - `resume_from_checkpoint`: True
546
+ - `hub_model_id`: None
547
+ - `hub_strategy`: every_save
548
+ - `hub_private_repo`: False
549
+ - `hub_always_push`: False
550
+ - `gradient_checkpointing`: False
551
+ - `gradient_checkpointing_kwargs`: None
552
+ - `include_inputs_for_metrics`: False
553
+ - `eval_do_concat_batches`: True
554
+ - `fp16_backend`: auto
555
+ - `push_to_hub_model_id`: None
556
+ - `push_to_hub_organization`: None
557
+ - `mp_parameters`:
558
+ - `auto_find_batch_size`: False
559
+ - `full_determinism`: False
560
+ - `torchdynamo`: None
561
+ - `ray_scope`: last
562
+ - `ddp_timeout`: 1800
563
+ - `torch_compile`: False
564
+ - `torch_compile_backend`: None
565
+ - `torch_compile_mode`: None
566
+ - `dispatch_batches`: None
567
+ - `split_batches`: None
568
+ - `include_tokens_per_second`: False
569
+ - `include_num_input_tokens_seen`: False
570
+ - `neftune_noise_alpha`: None
571
+ - `optim_target_modules`: None
572
+ - `batch_eval_metrics`: False
573
+ - `eval_on_start`: False
574
+ - `eval_use_gather_object`: False
575
+ - `batch_sampler`: batch_sampler
576
+ - `multi_dataset_batch_sampler`: proportional
577
+
578
+ </details>
579
+
580
+ ### Training Logs
581
+ | Epoch | Step | Training Loss | Validation Loss |
582
+ |:------:|:----:|:-------------:|:---------------:|
583
+ | 0.0932 | 100 | 0.3536 | - |
584
+ | 0.1864 | 200 | 0.227 | - |
585
+ | 0.2796 | 300 | 0.1599 | - |
586
+ | 0.3728 | 400 | 0.1448 | - |
587
+ | 0.4660 | 500 | 0.1276 | - |
588
+ | 0.5592 | 600 | 0.1187 | - |
589
+ | 0.6524 | 700 | 0.1191 | - |
590
+ | 0.7456 | 800 | 0.1082 | - |
591
+ | 0.8388 | 900 | 0.1026 | - |
592
+ | 0.9320 | 1000 | 0.0991 | - |
593
+ | 1.0 | 1073 | - | 0.0138 |
594
+ | 1.0252 | 1100 | 0.089 | - |
595
+ | 1.1184 | 1200 | 0.0759 | - |
596
+ | 1.2116 | 1300 | 0.0726 | - |
597
+ | 1.3048 | 1400 | 0.075 | - |
598
+ | 1.3979 | 1500 | 0.0732 | - |
599
+ | 1.4911 | 1600 | 0.07 | - |
600
+ | 1.5843 | 1700 | 0.0706 | - |
601
+ | 1.6775 | 1800 | 0.0708 | - |
602
+ | 1.7707 | 1900 | 0.0691 | - |
603
+ | 1.8639 | 2000 | 0.0713 | - |
604
+ | 1.9571 | 2100 | 0.0626 | - |
605
+ | 2.0 | 2146 | - | 0.0115 |
606
+ | 2.0503 | 2200 | 0.0564 | - |
607
+ | 2.1435 | 2300 | 0.0547 | - |
608
+ | 2.2367 | 2400 | 0.052 | - |
609
+ | 2.3299 | 2500 | 0.0491 | - |
610
+ | 2.4231 | 2600 | 0.0542 | - |
611
+ | 2.5163 | 2700 | 0.0506 | - |
612
+ | 2.6095 | 2800 | 0.0508 | - |
613
+ | 2.7027 | 2900 | 0.0493 | - |
614
+ | 2.7959 | 3000 | 0.0537 | - |
615
+ | 2.8891 | 3100 | 0.0499 | - |
616
+ | 2.9823 | 3200 | 0.0488 | - |
617
+ | 3.0 | 3219 | - | 0.0101 |
618
+ | 3.0755 | 3300 | 0.0444 | - |
619
+ | 3.1687 | 3400 | 0.0433 | - |
620
+ | 3.2619 | 3500 | 0.0425 | - |
621
+ | 3.3551 | 3600 | 0.0412 | - |
622
+ | 3.4483 | 3700 | 0.0451 | - |
623
+ | 3.5415 | 3800 | 0.0433 | - |
624
+ | 3.6347 | 3900 | 0.0429 | - |
625
+ | 3.7279 | 4000 | 0.0423 | - |
626
+ | 3.8211 | 4100 | 0.0445 | - |
627
+ | 3.9143 | 4200 | 0.0407 | - |
628
+ | 4.0 | 4292 | - | 0.0099 |
629
+ | 4.0075 | 4300 | 0.0415 | - |
630
+ | 4.1007 | 4400 | 0.0371 | - |
631
+ | 4.1938 | 4500 | 0.0376 | - |
632
+ | 4.2870 | 4600 | 0.037 | - |
633
+ | 4.3802 | 4700 | 0.0388 | - |
634
+ | 4.4734 | 4800 | 0.0352 | - |
635
+ | 4.5666 | 4900 | 0.0367 | - |
636
+ | 4.6598 | 5000 | 0.0377 | - |
637
+ | 4.7530 | 5100 | 0.0384 | - |
638
+ | 4.8462 | 5200 | 0.0355 | - |
639
+ | 4.9394 | 5300 | 0.0415 | - |
640
+ | 5.0 | 5365 | - | 0.0098 |
641
+
642
+
643
+ ### Framework Versions
644
+ - Python: 3.12.2
645
+ - Sentence Transformers: 3.2.1
646
+ - Transformers: 4.44.2
647
+ - PyTorch: 2.5.0
648
+ - Accelerate: 1.0.1
649
+ - Datasets: 3.0.2
650
+ - Tokenizers: 0.19.1
651
+
652
+ ## Citation
653
+
654
+ ### BibTeX
655
+
656
+ #### Sentence Transformers
657
+ ```bibtex
658
+ @inproceedings{reimers-2019-sentence-bert,
659
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
660
+ author = "Reimers, Nils and Gurevych, Iryna",
661
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
662
+ month = "11",
663
+ year = "2019",
664
+ publisher = "Association for Computational Linguistics",
665
+ url = "https://arxiv.org/abs/1908.10084",
666
+ }
667
+ ```
668
+
669
+ #### MultipleNegativesRankingLoss
670
+ ```bibtex
671
+ @misc{henderson2017efficient,
672
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
673
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
674
+ year={2017},
675
+ eprint={1705.00652},
676
+ archivePrefix={arXiv},
677
+ primaryClass={cs.CL}
678
+ }
679
+ ```
680
+
681
+ <!--
682
+ ## Glossary
683
+
684
+ *Clearly define terms in order to be accessible across audiences.*
685
+ -->
686
+
687
+ <!--
688
+ ## Model Card Authors
689
+
690
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
691
+ -->
692
+
693
+ <!--
694
+ ## Model Card Contact
695
+
696
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
697
+ -->
added_tokens.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "[TEXT]": 32768,
3
+ "[YEAR_RANGE]": 32769
4
+ }
config.json ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "/data/aronow/pankaj/Embeddings/Bioformer-MNRL-finetuned/checkpoint-5365",
3
+ "architectures": [
4
+ "BertModel"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "classifier_dropout": null,
8
+ "hidden_act": "gelu",
9
+ "hidden_dropout_prob": 0.1,
10
+ "hidden_size": 384,
11
+ "initializer_range": 0.02,
12
+ "intermediate_size": 1536,
13
+ "layer_norm_eps": 1e-12,
14
+ "max_position_embeddings": 1024,
15
+ "model_type": "bert",
16
+ "num_attention_heads": 6,
17
+ "num_hidden_layers": 16,
18
+ "pad_token_id": 0,
19
+ "position_embedding_type": "absolute",
20
+ "torch_dtype": "float32",
21
+ "transformers_version": "4.45.2",
22
+ "type_vocab_size": 2,
23
+ "use_cache": true,
24
+ "vocab_size": 32770
25
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.1.1",
4
+ "transformers": "4.45.2",
5
+ "pytorch": "2.4.1+cu121"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": null
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a55f3a88611239d6d860ea7241e6c95705177bea3750b7dfc50f05d12df57201
3
+ size 166100216
modules.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ }
14
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 1024,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "additional_special_tokens": [
3
+ "[TEXT]",
4
+ "[YEAR_RANGE]"
5
+ ],
6
+ "cls_token": {
7
+ "content": "[CLS]",
8
+ "lstrip": false,
9
+ "normalized": false,
10
+ "rstrip": false,
11
+ "single_word": false
12
+ },
13
+ "mask_token": {
14
+ "content": "[MASK]",
15
+ "lstrip": false,
16
+ "normalized": false,
17
+ "rstrip": false,
18
+ "single_word": false
19
+ },
20
+ "pad_token": {
21
+ "content": "[PAD]",
22
+ "lstrip": false,
23
+ "normalized": false,
24
+ "rstrip": false,
25
+ "single_word": false
26
+ },
27
+ "sep_token": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false
33
+ },
34
+ "unk_token": {
35
+ "content": "[UNK]",
36
+ "lstrip": false,
37
+ "normalized": false,
38
+ "rstrip": false,
39
+ "single_word": false
40
+ }
41
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,84 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "[PAD]",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "100": {
12
+ "content": "[UNK]",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "101": {
20
+ "content": "[CLS]",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "102": {
28
+ "content": "[SEP]",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "103": {
36
+ "content": "[MASK]",
37
+ "lstrip": false,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ },
43
+ "32768": {
44
+ "content": "[TEXT]",
45
+ "lstrip": false,
46
+ "normalized": false,
47
+ "rstrip": false,
48
+ "single_word": false,
49
+ "special": true
50
+ },
51
+ "32769": {
52
+ "content": "[YEAR_RANGE]",
53
+ "lstrip": false,
54
+ "normalized": false,
55
+ "rstrip": false,
56
+ "single_word": false,
57
+ "special": true
58
+ }
59
+ },
60
+ "additional_special_tokens": [
61
+ "[TEXT]",
62
+ "[YEAR_RANGE]"
63
+ ],
64
+ "clean_up_tokenization_spaces": true,
65
+ "cls_token": "[CLS]",
66
+ "do_basic_tokenize": true,
67
+ "do_lower_case": false,
68
+ "mask_token": "[MASK]",
69
+ "max_length": 1024,
70
+ "model_max_length": 1024,
71
+ "never_split": null,
72
+ "pad_to_multiple_of": null,
73
+ "pad_token": "[PAD]",
74
+ "pad_token_type_id": 0,
75
+ "padding_side": "right",
76
+ "sep_token": "[SEP]",
77
+ "stride": 0,
78
+ "strip_accents": null,
79
+ "tokenize_chinese_chars": true,
80
+ "tokenizer_class": "BertTokenizer",
81
+ "truncation_side": "right",
82
+ "truncation_strategy": "longest_first",
83
+ "unk_token": "[UNK]"
84
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff