m7n commited on
Commit
8446fde
1 Parent(s): c6effba

Add new SentenceTransformer model.

Browse files
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ tokenizer.json filter=lfs diff=lfs merge=lfs -text
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 1024,
3
+ "pooling_mode_cls_token": true,
4
+ "pooling_mode_mean_tokens": false,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,707 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: BAAI/bge-m3
3
+ datasets: []
4
+ language: []
5
+ library_name: sentence-transformers
6
+ metrics:
7
+ - cosine_accuracy
8
+ - dot_accuracy
9
+ - manhattan_accuracy
10
+ - euclidean_accuracy
11
+ - max_accuracy
12
+ pipeline_tag: sentence-similarity
13
+ tags:
14
+ - sentence-transformers
15
+ - sentence-similarity
16
+ - feature-extraction
17
+ - generated_from_trainer
18
+ - dataset_size:5000
19
+ - loss:TripletLoss
20
+ widget:
21
+ - source_sentence: 'now widely acknowledged that a study of scientific representation
22
+ should focus on the role of such imaginary entities in scientists'' reasoning.
23
+ However, the question is most of the time cast as follows: How can fictional or
24
+ abstract entities represent the phenomena? In this paper, I show that this question
25
+ is not well posed. First, I clarify the notion of representation, and I emphasise
26
+ the importance of what I call the "format" of a representation'
27
+ sentences:
28
+ - 'for the inferences agents can draw from it. Then, I show that the very same model
29
+ can be presented under different formats, which do not enable scientists to perform
30
+ the same inferences. Assuming that the main function of a representation is to
31
+ allow one to draw predictions and explanations of the phenomena by reasoning with
32
+ it, I conclude that imaginary models in abstracto are not used as representations:
33
+ scientists always reason with formatted'
34
+ - of representations that happen to arise in science. Empirical studies on ways
35
+ of representing used in science strongly suggest that a formal theory of representations
36
+ of
37
+ - 'some practitioners of Hypothetico-Deductivism suggest a methodology for the sciences
38
+ in which the ''best'' theory is, conversely, the most improbable on a priori grounds;
39
+ thus Popper in ''The Logic of Scientific Discovery'', Hutchinson, 1968, p. 142:
40
+ "Simplicity equals high prior improbability . . . Simple statements . . . are
41
+ to be prized . . . because they tell us more; because their empirical content
42
+ is greater; and because they are better testable". 5 This is due to, the fact
43
+ that the experimental law has its meaning independently given (Nagel note 3 above);
44
+ whereas "theoretical notions cannot be understood apart from the particular theory
45
+ that implicitly defines them". (Nagel, op. cit. p. 87); and consequently die with
46
+ the corresponding death of the parent theory. 6 "You will have noticed from this
47
+ formulation that it is not the accumulation of observations which I have in mind
48
+ when I speak of the growth of scientific knowledge, but the repeated overthrow
49
+ of scientific theories ... I can therefore gladly admit that falsificationists
50
+ like myself much prefer an attempt to solve an interesting problem by a bold conjecture,
51
+ even (and especially) if it soon turns out false, to any sequence of irrelevant
52
+ truisms. We prefer this because we believe that this is the way in which we can
53
+ learn from our mistakes; and that in finding out our conjecture was false, we
54
+ have learned much about the truth, and shall have got nearer the truth". (Popper
55
+ Conjectures & Refutations, pp. 215, 231). The Supposed'
56
+ - source_sentence: as you like"), many scholars deny that they enjoyed either positive
57
+ freedoms (in particular to speak free of interruption in the Assembly) or negative
58
+ freedoms, where the state could intervene as it wished, as
59
+ sentences:
60
+ - of prosody such as intonation, kinesic elements such as gestures, and paralinguistic
61
+ elements of expressive voice accompaniment for which the text provides both fewer
62
+ and less exact indications. More significant here are those paralinguistic, kinesic,
63
+ and even linguistic items for which the written text offers no notation whatsoever.'
64
+ Yet there are still other reasons that distinguish speech from language that are
65
+ less ARNOLD BERLEANT obvious but of more subtle presence. Something about speech
66
+ makes a claim on our attention; one cannot quite be indifferent to it. Speech
67
+ is, in essence, what phenomenologists call an intentional object, one which is
68
+ an object of our consciousness and toward which our consciousness is directed.
69
+ As such it exercises a peculiar but powerful attraction on us. There is, for example,
70
+ a challenge in giving a lecture. Because of the presence of a live audience that
71
+ is involved (we assume) in the presentation, we cannot ignore abrupt shifts in
72
+ ideas, weak transitions, lame inferences, strings of trivialities, or fuzzy ideas
73
+ which we might be able to get away with safely when writing. Words that have no
74
+ force by virtue of their sequence and their freshness drop lifeless from the lips
75
+ in embarrassment. This indeed signifies a basic difference between language in
76
+ general and its literary mode. Language is the inert material from which literature
77
+ is fashioned. Moreover, literature has the essential quality of speech and not
78
+ of the merely written word. Thus we can say that literature arises out of
79
+ - 'in the aftermath of the Persian wars. The ideal of freedom in these contexts
80
+ is the freedom of whole city-states, not of individuals. The final stage in Raaflaub''s
81
+ account of the concept of freedom is the emergence of a specifically democratic
82
+ conception of freedom, which he dates to the middle of the fifth century and the
83
+ rivalry between Athens and Sparta. The democratic conception of freedom, Raaflaub
84
+ argues, was at least in part an ideal for indi as well. In what follows I will
85
+ describe the key moments in Raaflaub''s history of freedom that illustrate its
86
+ collective character. Then I will discuss in detail Raaflaub''s account of the
87
+ democratic ideal of freedom and the freedom of individuals. Raaflaub argues that
88
+ the historical watershed that launches the concept of freedom into its central
89
+ role in Greek political life is the invasion of mainland Greece by Persia early
90
+ in the fifth century B.C. (chapter 3).4 This invasion presented an unprecedented
91
+ threat of foreign domination, sparking a newfound concern for the independence
92
+ and self-determination of Greek city-states. At the same time, the prolonged encounter
93
+ with a political culture the Greeks saw as tyrannical and slavish led them to
94
+ value anew the constitutions under which they lived: structured, law-governed,
95
+ and providing citizens some degree of participation in their governance. Thus
96
+ the Persian 4In what follows I reference chapters by topic, omitting chapter 1
97
+ (on methodological issues), chapter 2 (on the concept of freedom before the Persian
98
+ wars), and much of chapter'
99
+ - against Sokrates for his religious views. The current essay argues that in their
100
+ personal lives the Athenians were entirely free, except when speech or action
101
+ materially harmed the community. A second ideology
102
+ - source_sentence: 'the rest is only a development of this material. It is to them
103
+ we have to look for the subject matter of propositions. It is the task of the
104
+ theory of knowledge to find them and to understand their construction out of the
105
+ words or symbols. This task is very difficult, and Philosophy has hardly yet begun
106
+ to tackle it at some points. What method have we for tackling it ? The idea is
107
+ to express in an appropriate symbolism what in ordinary language leads to endless
108
+ misunderstandings. That is to say, where ordinary language disguises logical structure,
109
+ where it allows the formation of pseudopropositions, where it uses one term in
110
+ an infinity of different meanings, we must replace it by a symbolism which gives
111
+ a clear picture of the logical structure, excludes pseudopropositions, and uses
112
+ its terms unambiguously. Now we can only substitute a clear symbolism for the
113
+ unprecise one by inspecting the phenomena which we want to describe, thus trying
114
+ to understand their logical multiplicity. That is to say, we can only arrive at
115
+ a correct analysis by,what might be called, the logical investigation of the phenomena
116
+ themselves, i.e., in a certain sense a posteriori, and not by conjecturing about
117
+ a priori possibilities. One is often tempted to ask from an a priori standpoint:
118
+ What, after all, can be the only forms of atomic propositions, and to answer,
119
+ e.g., subject-predicate and relational propositions with two or more terms further,'
120
+ sentences:
121
+ - quantifier-free logics. In this paper, I will develop 64 normal modal semantic
122
+ tableau systems that can be extended by propositional quantifiers yielding 64
123
+ extended systems. All in all, we will investigate 128 different systems. I will
124
+ show how these systems can be used to prove some interesting theorems and I will
125
+ discuss Lewis's so-called existence postulate and some of its consequences. Finally,
126
+ I will prove
127
+ - 'perhaps, propositions relating predicates and relations to one another, and so
128
+ on. But this, I believe, is mere playing with words. An atomic form cauuot be
129
+ foreseen. And it would be surprising if the actual It 2 164 F. WITTGENSTEIN. phenomena
130
+ had nothing more to teach us about their structure. To such conjectures about
131
+ the structure of atomic propositions, we are led by our ordinary language, which
132
+ uses the subject-predicate and the relational form. But in this our language is
133
+ misleading: I will try to explain this by a simile. Let us imagine two parallel
134
+ planes, I and II. On plane I figures are drawn, say, ellipses and rectangles of
135
+ different sizes and shapes, and it is our task to produce images of these figures
136
+ on plane II. Then we can imagine two ways, amongst others, of doing this. We can,
137
+ first, lay down a law of projectionsay that of orthogonal projection or any other-and
138
+ then proceed to project all figures from I into II, according to this law. Or,
139
+ secondly, we could proceed thus: We lay down the rule that every ellipse on plane
140
+ I is to appear as a circle in plane II, and every rectangle as a square in II.
141
+ Such a way of representation may be convenient for us if for some reason we prefer
142
+ to draw only circles and squares on plane II. Of course, from these images the
143
+ exact shapes of the original figures on plane I cannot be immediately inferred.
144
+ We can only gather from them that the original was an ellipse'
145
+ - 'sentence can be said to be true precisely on the same conditions as in Wittgenstein''s
146
+ picture theory. For instance, a two-place relation is represented in a first-order
147
+ language by a relation symbol (say " R{ , )") with two argument-places, which
148
+ thus in effect defines a twoplace relation in the language (viz. the one holding
149
+ between the symbols which fill these argument-places) corresponding to the relation
150
+ it represents in the world. An atomic sentence in which the two argument places
151
+ have been filled by individual constants, say "(Ra,b)", is true if and only if
152
+ the relation represented by "R" holds between the individuals represented by "a"
153
+ and "b" (in this order). But this happens precisely when the linguistic relation
154
+ defined by "R" obtains between the symbols "a" and "6", that is, precisely when
155
+ the sentence "R(a,b)" is a true picture of the entities represented by "R" , "
156
+ a ", and "b" in Wittgenstein''s sence. 2 Wolfgang Stegmüller, " Eine modelltheoretische
157
+ Präzisierung der wittgensteinischen Bildtheorie ", Notre Dame Journal of Formal
158
+ Logic , vol. 7 (1966), pp. 181-195. 3 Erik Stenius Wittgenstein s Tractatus :
159
+ A Critical Exposition of Its Main Lines of Thought , Blackwell, Oxford, 1960.
160
+ 228 Jaakko Hintikka It seems to me highly significant that the closest Wittgenstein
161
+ comes in the Tractatus to giving us an actual example of an elementary proposition
162
+ is to present a relational formula of formal logic. In fact, in Tractatus 3.1432
163
+ Wittgenstein comes rather close to saying what I just said, and in 4.012 he says
164
+ in effect that the picture'
165
+ - source_sentence: 'for nouns: as pronouns refer to previously introduced nouns, so
166
+ prosentences like that''s true inherit their content from previously introduced
167
+ sentences. This kind of deflationism concerning the use of true (especially in
168
+ Brandom''s version) is an explanation in terms of anaphora; the prosentence depends
169
+ anaphorically on the sentence providing its content. A relevant implication of
170
+ this theory is that true is not understood as a predicate and that truth is not
171
+ a property. Primitivism, defended'
172
+ sentences:
173
+ - 'that by refusing to enter the state, anarchists prevent the state from performing
174
+ its legislative, executive and judicial functions, which are necessary in order
175
+ to have a minimal level of order and security. In doing so, anarchists expose
176
+ those living next to them to the dangers of the state of nature, thereby posing
177
+ an unjust threat. But since we all have a natural duty not to pose unjust threats
178
+ to others, anarchists and would-be independents have a duty to leave the state
179
+ of nature and enter the state.13 This duty correlates to a claim-right possessed
180
+ by those living next to them, who also have a right to act in self-defence in
181
+ order to enforce this obligation. This is what justifies the state''s right to
182
+ rule. Of course the idea of justifying state legitimacy by appealing to natural
183
+ duties is not new: John Rawls, Jeremy Waldron, Allen Buchanan, Christopher Wellman
184
+ and Thomas Christiano all defend versions of this view.14 But traditional natural
185
+ duty theories of legitimacy all ground political authority in positive duties
186
+ to help others or, more generally, to realize justice. The problem with these
187
+ views, however, is that the very existence of positive duties of this sort is
188
+ denied by many libertarians, who typically argue that all we have is negative
189
+ duties not to harm others. Moreover, even those 12 A further problem for Simmons
190
+ is that typically citizens do not regard the benefits provided by the state as
191
+ the products of a cooperative scheme. I find this objection less compelling, but
192
+ I cannot discuss it here.'
193
+ - 'by Frege, Moore, and Davidson, is associated with two ideas: (1) that truth is
194
+ a primitive and central trait of our conceptual system and (2) that truth, as
195
+ such, cannot be defined. This second claim can be called negative primitivism,
196
+ and it especially points out the facts about the indefinability of truth generally
197
+ advocated by primitivists. In what follows, a connection is established between
198
+ the deflationist''s rejection of the predicate and of the property and facts (and
199
+ primitivist'
200
+ - according to which Tr([(p]) and cp should be intersubstitutible in all (non-opaque)
201
+ contexts. Here I am focusing on the property of truth, but the issue could be
202
+ also raised for the concept (Asay 2013; forthcoming). It is not clear to me, though,
203
+ in what measure substantiality at the concept level without further specifications
204
+ is incompatible with deflationism, so I will put the question mostly aside. Perhaps,
205
+ the real problem here would be the thesis that a truth predicate only serves a
206
+ logical role. A thoroughly discussion, however, would be required. © 2014 The
207
+ Author dialéctica © 2014 Editorial Board of dialéctica 524 Andrea Strollo such
208
+ a thing as a single, determined deflationary conception;8 instead, we have a family9
209
+ of different approaches that share some motivations and philosophical suggestions.10
210
+ However, at least in its modern evolution, convergences are strong enough to allow
211
+ for a general treatment in terms of such an idealized theory, at least in many
212
+ cases.11 It is worth noting that such general tenets do not have the equal importance
213
+ in characterizing an approach as deflationary. In fact, if Tarskian biconditionals
214
+ and the thesis of the logical role could be abandoned without rejecting deflationism,12
215
+ or even combined with different philosophical conceptions,13 things are rather
216
+ different for the claim that truth is an unsubstantial property, if a property
217
+ at all. Such an (anti)metaphysical project of deflating the nature beside the
218
+ role of truth is the leitmotiv of the entire history of
219
+ - source_sentence: met, Peter van Inwagen, a libertarian, concludes that free will
220
+ is a mystery. Recently, the Mind Argument has drawn a number of criticisms. Here
221
+ I seek to add to its woes. Quite apart from its other problems, I argue, the Mind
222
+ Argument does a poor job of isolating the important concern for libertarians that
223
+ it raises. Once this concern has been clarified,
224
+ sentences:
225
+ - however, another argument serves to renew the challenge. The Assimilation Argument
226
+ challenges libertarians to explain how ostensible exercises of free will are relevantly
227
+ different from other causally undetermined outcomes, outcomes that nobody would
228
+ count as exercises of free will. In particular, libertarians must explain how
229
+ agents can have the power
230
+ - contended that the Assimilation Argument is unsound. Here I defend the Assimilation
231
+ Argument and the Rollback Argument, a second challenge to libertarianism that
232
+ Franklin rejects. My aim in doing so is to underscore the force of these challenges,
233
+ and thereby to resist what appears to be an emerging trend in
234
+ - '(p. 79), it must be capable of generating a theory of knowledge (p. 80), and
235
+ it must harmonise with a plausible account of the way contentful states explain
236
+ behaviour, which, the author argues, is non-causal (p. 139). The import of some
237
+ of these requirements becomes more clear in Part III, where the theory begins
238
+ to unfold. It is in fact a version of behaviourism, but with the saving grace
239
+ that it allows mental content to be found in behaviour, rather than trying to
240
+ reduce it away (p. 146). It also appears to find value to be intrinsic to behaviour.
241
+ To be in an intentional state-to have a belief or desire, or to possess a concept-is
242
+ to be "liable" to exhibit behavioural responses intrinsically assessable in evaluative
243
+ terms. Two values are articulated; T and D. The former is analogous to truth,
244
+ except that it applies to behaviour rather than beliefs; the latter is akin to
245
+ desirability, and is intended for assessments of a response''s conative import.
246
+ In fact, however, the theory progresses very little. The focus starts out as one
247
+ would expect: on belief and desire, the intentional states most relevant for explaining
248
+ behaviour. But a problem crops up immediately. The author''s account of it is
249
+ complicated by his views on opacity (pp. 180 ff.), but here is a simplified version.
250
+ Ideally, it seems, the treatment of belief would say that a person believes that
251
+ p just in case she is liable to make responses intrinsically assessable with respect
252
+ to T as "good'
253
+ model-index:
254
+ - name: SentenceTransformer based on BAAI/bge-m3
255
+ results:
256
+ - task:
257
+ type: triplet
258
+ name: Triplet
259
+ dataset:
260
+ name: m3
261
+ type: m3
262
+ metrics:
263
+ - type: cosine_accuracy
264
+ value: 0.978
265
+ name: Cosine Accuracy
266
+ - type: dot_accuracy
267
+ value: 0.022
268
+ name: Dot Accuracy
269
+ - type: manhattan_accuracy
270
+ value: 0.974
271
+ name: Manhattan Accuracy
272
+ - type: euclidean_accuracy
273
+ value: 0.978
274
+ name: Euclidean Accuracy
275
+ - type: max_accuracy
276
+ value: 0.978
277
+ name: Max Accuracy
278
+ - task:
279
+ type: triplet
280
+ name: Triplet
281
+ dataset:
282
+ name: all nli test
283
+ type: all-nli-test
284
+ metrics:
285
+ - type: cosine_accuracy
286
+ value: 0.9755
287
+ name: Cosine Accuracy
288
+ - type: dot_accuracy
289
+ value: 0.0245
290
+ name: Dot Accuracy
291
+ - type: manhattan_accuracy
292
+ value: 0.976
293
+ name: Manhattan Accuracy
294
+ - type: euclidean_accuracy
295
+ value: 0.9755
296
+ name: Euclidean Accuracy
297
+ - type: max_accuracy
298
+ value: 0.976
299
+ name: Max Accuracy
300
+ ---
301
+
302
+ # SentenceTransformer based on BAAI/bge-m3
303
+
304
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-m3](https://huggingface.co/BAAI/bge-m3). It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
305
+
306
+ ## Model Details
307
+
308
+ ### Model Description
309
+ - **Model Type:** Sentence Transformer
310
+ - **Base model:** [BAAI/bge-m3](https://huggingface.co/BAAI/bge-m3) <!-- at revision 5617a9f61b028005a4858fdac845db406aefb181 -->
311
+ - **Maximum Sequence Length:** 8192 tokens
312
+ - **Output Dimensionality:** 1024 tokens
313
+ - **Similarity Function:** Cosine Similarity
314
+ <!-- - **Training Dataset:** Unknown -->
315
+ <!-- - **Language:** Unknown -->
316
+ <!-- - **License:** Unknown -->
317
+
318
+ ### Model Sources
319
+
320
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
321
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
322
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
323
+
324
+ ### Full Model Architecture
325
+
326
+ ```
327
+ SentenceTransformer(
328
+ (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False}) with Transformer model: XLMRobertaModel
329
+ (1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
330
+ (2): Normalize()
331
+ )
332
+ ```
333
+
334
+ ## Usage
335
+
336
+ ### Direct Usage (Sentence Transformers)
337
+
338
+ First install the Sentence Transformers library:
339
+
340
+ ```bash
341
+ pip install -U sentence-transformers
342
+ ```
343
+
344
+ Then you can load this model and run inference.
345
+ ```python
346
+ from sentence_transformers import SentenceTransformer
347
+
348
+ # Download from the 🤗 Hub
349
+ model = SentenceTransformer("m7n/bge-m3-philosophy-triplets_v3")
350
+ # Run inference
351
+ sentences = [
352
+ 'met, Peter van Inwagen, a libertarian, concludes that free will is a mystery. Recently, the Mind Argument has drawn a number of criticisms. Here I seek to add to its woes. Quite apart from its other problems, I argue, the Mind Argument does a poor job of isolating the important concern for libertarians that it raises. Once this concern has been clarified,',
353
+ 'however, another argument serves to renew the challenge. The Assimilation Argument challenges libertarians to explain how ostensible exercises of free will are relevantly different from other causally undetermined outcomes, outcomes that nobody would count as exercises of free will. In particular, libertarians must explain how agents can have the power',
354
+ 'contended that the Assimilation Argument is unsound. Here I defend the Assimilation Argument and the Rollback Argument, a second challenge to libertarianism that Franklin rejects. My aim in doing so is to underscore the force of these challenges, and thereby to resist what appears to be an emerging trend in',
355
+ ]
356
+ embeddings = model.encode(sentences)
357
+ print(embeddings.shape)
358
+ # [3, 1024]
359
+
360
+ # Get the similarity scores for the embeddings
361
+ similarities = model.similarity(embeddings, embeddings)
362
+ print(similarities.shape)
363
+ # [3, 3]
364
+ ```
365
+
366
+ <!--
367
+ ### Direct Usage (Transformers)
368
+
369
+ <details><summary>Click to see the direct usage in Transformers</summary>
370
+
371
+ </details>
372
+ -->
373
+
374
+ <!--
375
+ ### Downstream Usage (Sentence Transformers)
376
+
377
+ You can finetune this model on your own dataset.
378
+
379
+ <details><summary>Click to expand</summary>
380
+
381
+ </details>
382
+ -->
383
+
384
+ <!--
385
+ ### Out-of-Scope Use
386
+
387
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
388
+ -->
389
+
390
+ ## Evaluation
391
+
392
+ ### Metrics
393
+
394
+ #### Triplet
395
+ * Dataset: `m3`
396
+ * Evaluated with [<code>TripletEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.TripletEvaluator)
397
+
398
+ | Metric | Value |
399
+ |:-------------------|:----------|
400
+ | cosine_accuracy | 0.978 |
401
+ | dot_accuracy | 0.022 |
402
+ | manhattan_accuracy | 0.974 |
403
+ | euclidean_accuracy | 0.978 |
404
+ | **max_accuracy** | **0.978** |
405
+
406
+ #### Triplet
407
+ * Dataset: `all-nli-test`
408
+ * Evaluated with [<code>TripletEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.TripletEvaluator)
409
+
410
+ | Metric | Value |
411
+ |:-------------------|:----------|
412
+ | cosine_accuracy | 0.9755 |
413
+ | dot_accuracy | 0.0245 |
414
+ | manhattan_accuracy | 0.976 |
415
+ | euclidean_accuracy | 0.9755 |
416
+ | **max_accuracy** | **0.976** |
417
+
418
+ <!--
419
+ ## Bias, Risks and Limitations
420
+
421
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
422
+ -->
423
+
424
+ <!--
425
+ ### Recommendations
426
+
427
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
428
+ -->
429
+
430
+ ## Training Details
431
+
432
+ ### Training Dataset
433
+
434
+ #### Unnamed Dataset
435
+
436
+
437
+ * Size: 5,000 training samples
438
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
439
+ * Approximate statistics based on the first 1000 samples:
440
+ | | anchor | positive | negative |
441
+ |:--------|:-------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
442
+ | type | string | string | string |
443
+ | details | <ul><li>min: 17 tokens</li><li>mean: 288.26 tokens</li><li>max: 566 tokens</li></ul> | <ul><li>min: 21 tokens</li><li>mean: 288.3 tokens</li><li>max: 565 tokens</li></ul> | <ul><li>min: 22 tokens</li><li>mean: 301.96 tokens</li><li>max: 577 tokens</li></ul> |
444
+ * Samples:
445
+ | anchor | positive | negative |
446
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
447
+ | <code>Reform (Boston: Auburn House, 1982). 7 See, e.g., K. Polk, 'Rape Reform and Criminal Justice Proces;ing', 31 Crime & Delinquency 191 (1985); W. Loh, 'The Impact of Common Law and Reform Rape Statutes on Prosecution: An Empirical Study', 55 Wash. L. Rev. 543, 552-54 (1980); J. Marsh, A. Geist & N. Caplan, Rape and the Limits of Law Reform 65 (1982). 8 See S. Estrich, Real Rape (Cambridge: Harvard University Press, 1987). 7 Susan Estrich throw their hands up in the air, depending on whether they saw themselves as feminists, or wanted me to. But nearly all of them said the same thing then, and most of them continue to repeat it even today: juries won't convict. You may think it's rape, and I may even think it's rape, but the public doesn't, and we've got too many real rapes in this office to waste our time on cases where the woman will eventually back out, and the jury would never convict. It hasn't been easy to dissuade them. Certainly, what went on in most law school classrooms in the nineteen-eighties, where rape was so politicized that one could hardly debate it, provides no answer to the real world concerns, or at least justifications, that animate so many prosecutors' offices. When I started teaching law in 1981, there was still nothing in the casebooks on rape, so I put my own materials together. Hard cases make good classes. I found better examples than the three English fools,</code> | <code>more sympathetic men: doctors, not drunken sailors. And women who didn't scream quite so much, or fight back quite so vigorously. Two great classes, as these things go. Most of the men I knew were reluctant to teach rape, for reasons that at first I didn't quite understand. But I convinced one to try the next year, and lent him my materials, and watched first with amusement and then with dismay when a delegation of women students came to visit him before class to complain that his (my) materials were sexist and inappropriate. I'd hoped those days were over. After all, all the casebooks now have sections on rape, and everyone covers it somehow. But only last week, a colleague who is one of the best criminal law professors in the country told me of being visited by a similar delegation in 1991, a decade later. They didn't like the way he taught rape. There were victims in the class. They didn't want to hear "the other side". In their view, there was no "other side". Nothing to debate. Lord knows, I'm sympathetic to those who have been victimized and who feel that they have nothing to learn from a debate about rape. Maybe they don't. But the rest of us, the rest of their classmates, the rest of the country, surely do. Silencing the debate only reenforces and reaffirms the status quo that many of us are trying to change. The debate I am interested in is not between the radical feminists</code> | <code>relations, thwarting possibilities of knowing the specific harms particular acts of rape enact well enough to render intelligible their far-reaching social-political-moral implications. Taking my point of departure from Debra Bergoffen's call for 'a new epistemology of rape', I</code> |
448
+ | <code>framed, because the key premissthe thesis of universal causal determinismis, in the domain of human behaviour, an unjustified conjecture based on over-simplified, under-informed explanatory models. Kant's semantics of singular cognitive reference (explained herein), which stands independently of his Transcendental Idealism, justifies and emphasises a quadruple distinction between causal description, causal ascription (predication), (approximately) true causal ascription (accurate predication) and cognitively justified causal ascription. Contemporary causal theories of mind, of action</code> | <code>or of meaning do not suffice for causal ascription, and so cannot suffice for causal predication, and hence cannot justify causal determinism about human behaviour. More generally, the principle of universal causal determinism is a regulative principle governing causal inquiry and was so formulated by LaPlace. Only successful, sufficient causal explanation of particular events provides for causal knowledge of those events. Such knowledge we lack in the domain of human behaviour. Rational belief, including scientific belief, requires apportioning belief to justifying evidence; all else</code> | <code>etc.; but causal explanation, in its advanced stages, is not content with such vague statements of uniformities. The uniformities which are sought are the ones which can be correlated with precise, numerical relations. A third characteristic which has often been associated with causality is the concept of determinism or necessity. This third trait of causality has been violently criticized. From Hume to Russell it has been a common contention that necessity is a fiction and that causality merely means invariable connection of uniformity of processes. For the present, without analysing the objections raised to the necessity or determinism of causal relations, I shall limit myself to brief comments on the meaning of necessity, and on the specific sense of necessity in the causal relation. Though necessity, or determinism, should be a fiction or a myth, it is certainly not a meaningless concept. Hume, who presents the most effective objections to the concept of necessity, was quite certain that the concept had a meaning, since otherwise his criticisms would have been of no importance. What then is necessity? Briefly, necessity refers to deductive systems where from self-evident, postu' S. Alexander, Space, Time and Deity, vol. i, p. 288. 320 CAUSALITY lated, or "convenient" concepts other concepts are derived. The clearest illustration of the notion of necessity is the concept of classes where the implicative relation is that of inclusion, or that of from species to genus. Causal relations are deterministic in the sense that cause and effect contain</code> |
449
+ | <code>presuppositions. In a nutshell view of aspects of the history of science the presuppositions, as part of the relevant historical context, operate within the scientific enterprise. Science used to be views as positivist and final; yet, the logic of methodology, from E.A. Burtt to T.S. Kuhn and Karl Popper</code> | <code>show that presuppositions are part of the scientific research. Particularly the pioneering work of the archeologist and philosopher R.G. Collingwood and the seminal thinker S.N. Nasr, who opened the doors for the inquiry of the role of science within Islamic Civilization, are highlighted. We show that the</code> | <code>continues to evolve, it seems most probable that it will someday diverge again from its seeming congruity with the teachings of the Qur'an. An even sharper critique arises from the unexpressed as sumption that there exists a standard of knowledge on a par with, if not actually higher than, Allah's revealed truth to which that revelation must be submitted for its validation. Thus, while this attempt to claim unity (or at least non-contradiction) between modern science and Islamic revelation does not require any sort of limitation on science or scientific research, it remains unacceptable to the majority of modern scientists, as well as to many Muslim thinkers, because of the implications of its assumptions.8 The first assumption, that science has reached its peak of development, is unacceptable to science. The second assumption, that religious dogma should, or even can, be judged by the tenets of modern science, is rejected by many Muslim thinkers. Scientific Knowledge Needs an Islamic Interpretation In response to the unacceptable and often Positivistic analysis of Bucaille, the Islamic intellectual community has adopted more radical al ternatives, such as Ismail al-Faruqi's call for an Islamization of all knowledge, including modern scientific knowledge.9 His argument grows out of a perception that Islamic culture, which once led the Mediterranean world, is increasingly threatened by secularized and individualistic West ern values?a threat that is as much cultural as it is political. Faruqi's plea for an Islamization of knowledge, then, should be read in the context</code> |
450
+ * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
451
+ ```json
452
+ {
453
+ "distance_metric": "TripletDistanceMetric.COSINE",
454
+ "triplet_margin": 0.05
455
+ }
456
+ ```
457
+
458
+ ### Evaluation Dataset
459
+
460
+ #### Unnamed Dataset
461
+
462
+
463
+ * Size: 500 evaluation samples
464
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
465
+ * Approximate statistics based on the first 1000 samples:
466
+ | | anchor | positive | negative |
467
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
468
+ | type | string | string | string |
469
+ | details | <ul><li>min: 29 tokens</li><li>mean: 288.01 tokens</li><li>max: 646 tokens</li></ul> | <ul><li>min: 26 tokens</li><li>mean: 286.17 tokens</li><li>max: 499 tokens</li></ul> | <ul><li>min: 16 tokens</li><li>mean: 289.98 tokens</li><li>max: 544 tokens</li></ul> |
470
+ * Samples:
471
+ | anchor | positive | negative |
472
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
473
+ | <code>public have now before them, the evidences of natural religion, the evidences of revealed religion, and an account of the duties that result from both" (Paley 1829, v-vi ). Without the arguments of natural theology, revelation and religious morality could not be discussed. The evidence of human reason and of the empirical study of the natural world, was the basis for establishing the existence and character of a deity. Only after these were established could one consider other sources of knowledge. Without the arguments available from natural theology that there exists a single unified, beneficent, and omnipresent deity, there is no warrant for accepting that the deity is capable of revelation, that revelation ought to be a trustworthy basis for knowledge, or that there exists a moral order or set of moral duties. The mere existence of a deity was insufficient for Paley. As a work of theology, the most important conclusion is not only that God merely exists, but also that characteristics about God can be known and inferences about the proper relationship of the individual and society to God can be drawn. As a result, much of the early chapters of Natural Theology did not put forth an argument for the existence of God, but instead excluded bad arguments for the existence of God. In order to understand what Paley's argument is, it is paramount to also understand what it is not. 58 ADAM R. SHAPIRO With the recent restoration of Paley's work, several historians and philosophers of science have begun this</code> | <code>re-examination. Sander Gliboff noted that "Paley devoted a chapter to comparing rival hypotheses unfavorably with his own and eliminating them from consideration one by one" (Gliboff 2000). For Gliboff, doing so was essential for establishing the legitimacy of the central argument of the Natural Theology. The central argument, which is what Paley himself referred to as "the argument" in the opening chapters of the book, is an inference that can be formulated as the claim that if an object has a "purpose," then it has a designer. In analyzing this, Gliboff agreed with Eliot Sober's 1993 assessment that Paley's central argument was based upon "inference to the best explanation" (Sober 2000, 31). Sober ascribed to Paley two possible explanations for apparent purposeful objects, design and random chance. The explanation that purposeful objects have their origins in a designer is pitted against the other possible explanation for the appearance of purpose in an object and is ruled the most likely. Gliboff showed that these were not the only contenders and claims that Paley refutes several alternatives that make up a "pool" of possible explanations. The inference to a designer is judged the most likely. If Natural Theology was in fact based upon an inference to the best explanation, then it becomes clear why a Darwinian conclusion might be seen to uproot Paley. If the workings of nature can be seen to generate purpose without the apparent intentional workings of a designer, then alternate explanations are strengthened and</code> | <code>and intellectual sides of the stratagem. In response to Erasmus Darwin's evolutionary theories. Paley asserted that the divinely designed architecture of nature had remained unchanged since the creation. But the more he emphasized the preordained nature of providence, its effectuation through mechanical dispositions, the less room there appeared to be for particular interventions. Section 2 concentrates on Paley's efforts to reconcile this model of a law-governed, mechanical universe, with the belief</code> |
474
+ | <code>the two lines of work from Sections 2 and 3 together: the realization that there is no in principle difference between functional and natural kinds regarding their epistemic importance for scientific reasoning and explanation provides clues how the problem for HPC theory that I pointed out in Section 2 can be resolved, and HPC theory can be turned into a full‐fledged theory of natural kindhood able to cover kinds in the special sciences as well as the less controversial natural kinds. 1. Brigandt (2009) recently reached a similar conclusion on the basis of different considerations. 2. A Problem for HPC Theory. In the philosophical literature on the topic, two distinct ways of thinking about natural kinds can be found. On the one hand, there is the essentialist tradition that, broadly taken, understands natural kinds as groupings of things according to their natures, their intrinsic properties or causal capacities, their microscopic structures, and so on. On the other hand, there is the more recent tradition that understands natural kinds as groupings of things over which we can make reliable inductions. That these lines of work really are quite distinct can be seen from the way in which they conceive of the problem of natural kinds.2 The former line of work conceives of the problem as a metaphysical problem, that is, as the question, What sorts of things are there in the world? As Brian Ellis put it in a recent defense of essentialism, “membership of a natural kind is</code> | <code>decided by nature, not by us” (2001, 19). The latter line of work, in contrast, sees it as foremost a question in epistemology, that is, as the question, Which ways of grouping things are best suited to help us make inferences and to explain phenomena? Boyd, for example, asserted that “it is a truism that the philosophical theory of natural kinds is about how classificatory schemes come to contribute to the epistemic reliability of inductive and explanatory practices” (1999a, 146; also 1999b, 69). On this view, kind membership is decided more by us than by nature. With respect to the explanation‐grounding capacity of natural kinds, the two lines of work provide different answers that run into different kinds of problems. From the perspective of the essentialist tradition it should be no miracle that natural kinds ground reliable inferences and explanations. If there is a definitive, objective way in which the world is made up of kinds of things, then clearly any explanation of a given phenomenon should ultimately make reference to some of these objectively existing kinds of things. Once we have achieved an inventory of the various kinds that exist in the world and have an account of their metaphysics, we have a theory of the kinds that can feature in our explanations. The problem, however, is that we do not have any direct access to the world that would allow us to compile the required inventory of the world’s furniture. Our best bet at obtaining such an inventory is to</code> | <code>There is, however, a selection of strong papers in the collection that do address key topics in the contemporary debate concerning natural kinds— too many, in fact, to review them all here (and hope to do them justice). As such, I shall mention a selection that struck me as of particular interest, since each addresses one of the perennial themes in the natural kinds debate, namely the alleged epistemic utility of natural kinds, and their role in explanatory and inductive reasoning. I'll start with Chapter 2 of the collection, a paper written by Peter Godfrey-Smith. In this chapter, Godfrey-Smith discusses inductive inferences and their associated justification. Inductive inferences are a perennial philo sophical topic, cropping up in both the philosophy of science and epistemol ogy. In the context of the topic of the edited collection, there are apparent inductive benefits bestowed by recognizing and classifying objects into natural kinds, which makes induction a central topic to the broader debate concerning natural kinds. Godfrey-Smith identifies what he takes to be two conflated types of induct ive inference and goes on to sketch a philosophical justification for each, re spectively, which has not been used to justify what philosophers have historically referred to as 'inductive' inferences. The approach employed builds upon Frank Jackson's response to Nelson Goodman's classic 'grue' example, from his new riddle of induction. Godfrey-Smith sketches a method of inference that is built upon statistical models of population sampling, where there are rules telling us how the dis tribution</code> |
475
+ | <code>model, can be chosen in several ways from the set of all elementary experiences. In case (ii) it is obviously conceivable that the mental stream, like a loaf of bread, can be cut into slices in more than one way. At the same time (in ? 67) that Carnap is inclined to consider elementary experiences as abstractions of some kind (as mere 'places' in the stream), he maintains that they are what is "primarily given" both psychologically and epistemologically. The set of all elementary experiences occurring in one's own mental life is symbolically designated by erl. When I wish to show that the owner is a person A, I shall write erlA. When speaking, in what follows, of elementary experiences I shall always assume that we are dealing with experiences of one and the same person unless the contrary is explicitly stated. 1.3. Quality Points The visual field that is a part of my present elementary experience is assumed in the Aufbau to be a two-dimensional spatial configuration of color points. Each color point is characterized by its two 'local signs', which together determine its place in the visual field, and by its hue, saturation, and brightness, which together determine its place in the color solid.9 In this sense, the color points occurring in the elementary expe riences can be thought of as points in a five-dimensional space (the word 'space' here understood in an abstract sense), which may be called the HOW CARNAP BUILT THE WORLD IN</code> | <code>1928 341 color space.10 The color points occurring in a given elementary experience will then be a certain subset of the color space. In that space any two points whatsoever have a mutual distance. (How the distance is to be measured is left unexplained in the Aufbau.) If the distance is less than or equal to a certain standard distance, the two points are said to be similar (?hnlich).11 This notion of similarity thus involves the notion of a standard distance. Carnap seems to consider this distance as small, but otherwise he says nothing about it. (The question as to how the choice of standard distance affects the constructions of the Aufbau is disturbing and impossible to answer.) If the points p and p' are similar and if p occurs in the elementary experience e and/?' in e9 then e and e' are said to be part similar (teil?hnlich).,12 Color points are a species of the genus quality points. On one level of thought, in one sense, Carnap considers an elementary experience as a complex of a number of quality points of various kinds. One kind are visual color points, another auditive qualities, a third olfactory, etc. He is also prepared to recognize emotive, volitive, and intellectual qualities (quality points). The word 'sense class' is used to denote such a kind of quality.13 For each sense class, just as for vision, notions of distance and standard distance are postulated. Two points from the same sense class are said to be similar if</code> | <code>presentational of worldly properties, but as blank sensations, extrinsically endowed with representational powers in virtue of standing as 'natural signs' for their normal causes.1 On that view, both colour perceptions and colour beliefs normally correspond with reality, in the simple sense of being true. Still, the view retains a strong case to be regarded a form of anti-realism. This has nothing to do with a failure of correspondence in the sense of plain truth; nor is it to be characterized as a failure of any other kind of 'correspondence'. It is rather to be brought out by means of a contrast in directions of explanation with respect to the individuation of perceptions of colour and the colours of things. Compare the case of shapes. The most basic distinctions are between squareness and circularity, say, as properties of things in the world. Having first identified which property squareness is, we may then identify perceptions of squareness as those which present something as having that property. On the current view concerning colour, the direction of explanation is the reverse. The most basic distinctions are between experiences of redness and experiences of greenness, say, conceived as blank sensations. Having made such distinctions, we may then define a property redness which applies to mind-independent objects, as that of being disposed to produce those experiences red-type ones or as the property of having whatever physical constitution actually grounds that disposition. Thus, although, the colours are perfectly real, in the sense that representations</code> |
476
+ * Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
477
+ ```json
478
+ {
479
+ "distance_metric": "TripletDistanceMetric.COSINE",
480
+ "triplet_margin": 0.05
481
+ }
482
+ ```
483
+
484
+ ### Training Hyperparameters
485
+ #### Non-Default Hyperparameters
486
+
487
+ - `eval_strategy`: steps
488
+ - `per_device_train_batch_size`: 4
489
+ - `per_device_eval_batch_size`: 4
490
+ - `learning_rate`: 1e-05
491
+ - `warmup_ratio`: 0.1
492
+ - `batch_sampler`: no_duplicates
493
+
494
+ #### All Hyperparameters
495
+ <details><summary>Click to expand</summary>
496
+
497
+ - `overwrite_output_dir`: False
498
+ - `do_predict`: False
499
+ - `eval_strategy`: steps
500
+ - `prediction_loss_only`: True
501
+ - `per_device_train_batch_size`: 4
502
+ - `per_device_eval_batch_size`: 4
503
+ - `per_gpu_train_batch_size`: None
504
+ - `per_gpu_eval_batch_size`: None
505
+ - `gradient_accumulation_steps`: 1
506
+ - `eval_accumulation_steps`: None
507
+ - `learning_rate`: 1e-05
508
+ - `weight_decay`: 0.0
509
+ - `adam_beta1`: 0.9
510
+ - `adam_beta2`: 0.999
511
+ - `adam_epsilon`: 1e-08
512
+ - `max_grad_norm`: 1.0
513
+ - `num_train_epochs`: 3
514
+ - `max_steps`: -1
515
+ - `lr_scheduler_type`: linear
516
+ - `lr_scheduler_kwargs`: {}
517
+ - `warmup_ratio`: 0.1
518
+ - `warmup_steps`: 0
519
+ - `log_level`: passive
520
+ - `log_level_replica`: warning
521
+ - `log_on_each_node`: True
522
+ - `logging_nan_inf_filter`: True
523
+ - `save_safetensors`: True
524
+ - `save_on_each_node`: False
525
+ - `save_only_model`: False
526
+ - `restore_callback_states_from_checkpoint`: False
527
+ - `no_cuda`: False
528
+ - `use_cpu`: False
529
+ - `use_mps_device`: False
530
+ - `seed`: 42
531
+ - `data_seed`: None
532
+ - `jit_mode_eval`: False
533
+ - `use_ipex`: False
534
+ - `bf16`: False
535
+ - `fp16`: False
536
+ - `fp16_opt_level`: O1
537
+ - `half_precision_backend`: auto
538
+ - `bf16_full_eval`: False
539
+ - `fp16_full_eval`: False
540
+ - `tf32`: None
541
+ - `local_rank`: 0
542
+ - `ddp_backend`: None
543
+ - `tpu_num_cores`: None
544
+ - `tpu_metrics_debug`: False
545
+ - `debug`: []
546
+ - `dataloader_drop_last`: False
547
+ - `dataloader_num_workers`: 0
548
+ - `dataloader_prefetch_factor`: None
549
+ - `past_index`: -1
550
+ - `disable_tqdm`: False
551
+ - `remove_unused_columns`: True
552
+ - `label_names`: None
553
+ - `load_best_model_at_end`: False
554
+ - `ignore_data_skip`: False
555
+ - `fsdp`: []
556
+ - `fsdp_min_num_params`: 0
557
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
558
+ - `fsdp_transformer_layer_cls_to_wrap`: None
559
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
560
+ - `deepspeed`: None
561
+ - `label_smoothing_factor`: 0.0
562
+ - `optim`: adamw_torch
563
+ - `optim_args`: None
564
+ - `adafactor`: False
565
+ - `group_by_length`: False
566
+ - `length_column_name`: length
567
+ - `ddp_find_unused_parameters`: None
568
+ - `ddp_bucket_cap_mb`: None
569
+ - `ddp_broadcast_buffers`: False
570
+ - `dataloader_pin_memory`: True
571
+ - `dataloader_persistent_workers`: False
572
+ - `skip_memory_metrics`: True
573
+ - `use_legacy_prediction_loop`: False
574
+ - `push_to_hub`: False
575
+ - `resume_from_checkpoint`: None
576
+ - `hub_model_id`: None
577
+ - `hub_strategy`: every_save
578
+ - `hub_private_repo`: False
579
+ - `hub_always_push`: False
580
+ - `gradient_checkpointing`: False
581
+ - `gradient_checkpointing_kwargs`: None
582
+ - `include_inputs_for_metrics`: False
583
+ - `eval_do_concat_batches`: True
584
+ - `fp16_backend`: auto
585
+ - `push_to_hub_model_id`: None
586
+ - `push_to_hub_organization`: None
587
+ - `mp_parameters`:
588
+ - `auto_find_batch_size`: False
589
+ - `full_determinism`: False
590
+ - `torchdynamo`: None
591
+ - `ray_scope`: last
592
+ - `ddp_timeout`: 1800
593
+ - `torch_compile`: False
594
+ - `torch_compile_backend`: None
595
+ - `torch_compile_mode`: None
596
+ - `dispatch_batches`: None
597
+ - `split_batches`: None
598
+ - `include_tokens_per_second`: False
599
+ - `include_num_input_tokens_seen`: False
600
+ - `neftune_noise_alpha`: None
601
+ - `optim_target_modules`: None
602
+ - `batch_eval_metrics`: False
603
+ - `eval_on_start`: False
604
+ - `batch_sampler`: no_duplicates
605
+ - `multi_dataset_batch_sampler`: proportional
606
+
607
+ </details>
608
+
609
+ ### Training Logs
610
+ | Epoch | Step | Training Loss | loss | all-nli-test_max_accuracy | m3_max_accuracy |
611
+ |:------:|:----:|:-------------:|:------:|:-------------------------:|:---------------:|
612
+ | 0 | 0 | - | - | - | 0.91 |
613
+ | 0.08 | 100 | 0.0083 | 0.0101 | - | 0.916 |
614
+ | 0.16 | 200 | 0.0094 | 0.0084 | - | 0.932 |
615
+ | 0.24 | 300 | 0.0075 | 0.0070 | - | 0.942 |
616
+ | 0.32 | 400 | 0.0085 | 0.0065 | - | 0.952 |
617
+ | 0.4 | 500 | 0.0068 | 0.0058 | - | 0.956 |
618
+ | 0.48 | 600 | 0.0064 | 0.0057 | - | 0.958 |
619
+ | 0.56 | 700 | 0.0063 | 0.0051 | - | 0.964 |
620
+ | 0.64 | 800 | 0.0064 | 0.0047 | - | 0.974 |
621
+ | 0.72 | 900 | 0.0049 | 0.0044 | - | 0.974 |
622
+ | 0.8 | 1000 | 0.0057 | 0.0043 | - | 0.966 |
623
+ | 0.88 | 1100 | 0.0042 | 0.0054 | - | 0.96 |
624
+ | 0.96 | 1200 | 0.0037 | 0.0045 | - | 0.972 |
625
+ | 1.04 | 1300 | 0.006 | 0.0056 | - | 0.962 |
626
+ | 1.12 | 1400 | 0.0043 | 0.0053 | - | 0.96 |
627
+ | 1.2 | 1500 | 0.0026 | 0.0045 | - | 0.966 |
628
+ | 1.28 | 1600 | 0.001 | 0.0046 | - | 0.968 |
629
+ | 1.3600 | 1700 | 0.0012 | 0.0045 | - | 0.962 |
630
+ | 1.44 | 1800 | 0.0007 | 0.0042 | - | 0.968 |
631
+ | 1.52 | 1900 | 0.0006 | 0.0044 | - | 0.968 |
632
+ | 1.6 | 2000 | 0.0013 | 0.0040 | - | 0.97 |
633
+ | 1.6800 | 2100 | 0.0006 | 0.0038 | - | 0.974 |
634
+ | 1.76 | 2200 | 0.0011 | 0.0036 | - | 0.97 |
635
+ | 1.8400 | 2300 | 0.0012 | 0.0036 | - | 0.966 |
636
+ | 1.92 | 2400 | 0.001 | 0.0038 | - | 0.968 |
637
+ | 2.0 | 2500 | 0.0009 | 0.0038 | - | 0.974 |
638
+ | 2.08 | 2600 | 0.0006 | 0.0037 | - | 0.976 |
639
+ | 2.16 | 2700 | 0.0006 | 0.0037 | - | 0.98 |
640
+ | 2.24 | 2800 | 0.0004 | 0.0035 | - | 0.978 |
641
+ | 2.32 | 2900 | 0.0003 | 0.0034 | - | 0.978 |
642
+ | 2.4 | 3000 | 0.0003 | 0.0034 | - | 0.978 |
643
+ | 2.48 | 3100 | 0.0001 | 0.0033 | - | 0.978 |
644
+ | 2.56 | 3200 | 0.0003 | 0.0034 | - | 0.978 |
645
+ | 2.64 | 3300 | 0.0003 | 0.0034 | - | 0.976 |
646
+ | 2.7200 | 3400 | 0.0001 | 0.0033 | - | 0.976 |
647
+ | 2.8 | 3500 | 0.0003 | 0.0033 | - | 0.976 |
648
+ | 2.88 | 3600 | 0.0003 | 0.0033 | - | 0.976 |
649
+ | 2.96 | 3700 | 0.0003 | 0.0033 | - | 0.978 |
650
+ | 3.0 | 3750 | - | - | 0.976 | - |
651
+
652
+
653
+ ### Framework Versions
654
+ - Python: 3.10.12
655
+ - Sentence Transformers: 3.0.1
656
+ - Transformers: 4.42.4
657
+ - PyTorch: 2.3.1+cu121
658
+ - Accelerate: 0.32.1
659
+ - Datasets: 2.21.0
660
+ - Tokenizers: 0.19.1
661
+
662
+ ## Citation
663
+
664
+ ### BibTeX
665
+
666
+ #### Sentence Transformers
667
+ ```bibtex
668
+ @inproceedings{reimers-2019-sentence-bert,
669
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
670
+ author = "Reimers, Nils and Gurevych, Iryna",
671
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
672
+ month = "11",
673
+ year = "2019",
674
+ publisher = "Association for Computational Linguistics",
675
+ url = "https://arxiv.org/abs/1908.10084",
676
+ }
677
+ ```
678
+
679
+ #### TripletLoss
680
+ ```bibtex
681
+ @misc{hermans2017defense,
682
+ title={In Defense of the Triplet Loss for Person Re-Identification},
683
+ author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
684
+ year={2017},
685
+ eprint={1703.07737},
686
+ archivePrefix={arXiv},
687
+ primaryClass={cs.CV}
688
+ }
689
+ ```
690
+
691
+ <!--
692
+ ## Glossary
693
+
694
+ *Clearly define terms in order to be accessible across audiences.*
695
+ -->
696
+
697
+ <!--
698
+ ## Model Card Authors
699
+
700
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
701
+ -->
702
+
703
+ <!--
704
+ ## Model Card Contact
705
+
706
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
707
+ -->
config.json ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "BAAI/bge-m3",
3
+ "architectures": [
4
+ "XLMRobertaModel"
5
+ ],
6
+ "attention_probs_dropout_prob": 0.1,
7
+ "bos_token_id": 0,
8
+ "classifier_dropout": null,
9
+ "eos_token_id": 2,
10
+ "hidden_act": "gelu",
11
+ "hidden_dropout_prob": 0.1,
12
+ "hidden_size": 1024,
13
+ "initializer_range": 0.02,
14
+ "intermediate_size": 4096,
15
+ "layer_norm_eps": 1e-05,
16
+ "max_position_embeddings": 8194,
17
+ "model_type": "xlm-roberta",
18
+ "num_attention_heads": 16,
19
+ "num_hidden_layers": 24,
20
+ "output_past": true,
21
+ "pad_token_id": 1,
22
+ "position_embedding_type": "absolute",
23
+ "torch_dtype": "float32",
24
+ "transformers_version": "4.42.4",
25
+ "type_vocab_size": 1,
26
+ "use_cache": true,
27
+ "vocab_size": 250002
28
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "3.0.1",
4
+ "transformers": "4.42.4",
5
+ "pytorch": "2.3.1+cu121"
6
+ },
7
+ "prompts": {},
8
+ "default_prompt_name": null,
9
+ "similarity_fn_name": null
10
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d8c6c878abb1eb7f07d4023a74fb2f66f8f5d8f9fed3b4dbeab62036c9ef86df
3
+ size 2271064456
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 8192,
3
+ "do_lower_case": false
4
+ }
sentencepiece.bpe.model ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cfc8146abe2a0488e9e2a0c56de7952f7c11ab059eca145a0a727afce0db2865
3
+ size 5069051
special_tokens_map.json ADDED
@@ -0,0 +1,51 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token": {
3
+ "content": "<s>",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "cls_token": {
10
+ "content": "<s>",
11
+ "lstrip": false,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "eos_token": {
17
+ "content": "</s>",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "mask_token": {
24
+ "content": "<mask>",
25
+ "lstrip": true,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "pad_token": {
31
+ "content": "<pad>",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ },
37
+ "sep_token": {
38
+ "content": "</s>",
39
+ "lstrip": false,
40
+ "normalized": false,
41
+ "rstrip": false,
42
+ "single_word": false
43
+ },
44
+ "unk_token": {
45
+ "content": "<unk>",
46
+ "lstrip": false,
47
+ "normalized": false,
48
+ "rstrip": false,
49
+ "single_word": false
50
+ }
51
+ }
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e4f7e21bec3fb0044ca0bb2d50eb5d4d8c596273c422baef84466d2c73748b9c
3
+ size 17083053
tokenizer_config.json ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "<s>",
5
+ "lstrip": false,
6
+ "normalized": false,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": true
10
+ },
11
+ "1": {
12
+ "content": "<pad>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "2": {
20
+ "content": "</s>",
21
+ "lstrip": false,
22
+ "normalized": false,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": true
26
+ },
27
+ "3": {
28
+ "content": "<unk>",
29
+ "lstrip": false,
30
+ "normalized": false,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": true
34
+ },
35
+ "250001": {
36
+ "content": "<mask>",
37
+ "lstrip": true,
38
+ "normalized": false,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": true
42
+ }
43
+ },
44
+ "bos_token": "<s>",
45
+ "clean_up_tokenization_spaces": true,
46
+ "cls_token": "<s>",
47
+ "eos_token": "</s>",
48
+ "mask_token": "<mask>",
49
+ "model_max_length": 8192,
50
+ "pad_token": "<pad>",
51
+ "sep_token": "</s>",
52
+ "sp_model_kwargs": {},
53
+ "tokenizer_class": "XLMRobertaTokenizer",
54
+ "unk_token": "<unk>"
55
+ }