SimoneAstarita commited on
Commit
6aa86af
1 Parent(s): f789c03

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -12
README.md CHANGED
@@ -79,7 +79,7 @@ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [S
79
  - **Maximum Sequence Length:** 512 tokens
80
  - **Output Dimensionality:** 384 tokens
81
  - **Similarity Function:** Cosine Similarity
82
- <!-- - **Training Dataset:** Unknown -->
83
  - **Language:** en
84
  - **License:** apache-2.0
85
 
@@ -171,6 +171,8 @@ You can finetune this model on your own dataset.
171
 
172
  ### Training Dataset
173
 
 
 
174
  #### Unnamed Dataset
175
 
176
 
@@ -326,17 +328,6 @@ You can finetune this model on your own dataset.
326
 
327
  | Epoch | Step | Training Loss |
328
  |:------:|:-----:|:-------------:|
329
- | 0.0077 | 100 | 0.4784 |
330
- | 0.0154 | 200 | 0.2415 |
331
- | 0.0231 | 300 | 0.0424 |
332
- | 0.0307 | 400 | 0.021 |
333
- | 0.0384 | 500 | 0.0149 |
334
- | 0.0461 | 600 | 0.0081 |
335
- | 0.0538 | 700 | 0.0084 |
336
- | 0.0615 | 800 | 0.0067 |
337
- | 0.0692 | 900 | 0.0034 |
338
- | 0.0769 | 1000 | 0.0025 |
339
- | 0.0846 | 1100 | 0.0016 |
340
  | 0.0077 | 100 | 0.0025 |
341
  | 0.0154 | 200 | 0.0032 |
342
  | 0.0231 | 300 | 0.0026 |
@@ -768,6 +759,8 @@ You can finetune this model on your own dataset.
768
  }
769
  ```
770
 
 
 
771
  <!--
772
  ## Glossary
773
 
 
79
  - **Maximum Sequence Length:** 512 tokens
80
  - **Output Dimensionality:** 384 tokens
81
  - **Similarity Function:** Cosine Similarity
82
+ - **Training Dataset:** scraped astronomy papers at the NLP for Space Science workshop.
83
  - **Language:** en
84
  - **License:** apache-2.0
85
 
 
171
 
172
  ### Training Dataset
173
 
174
+ The dataset is made of scrapes papers in astronomy, including abstract, introduction and conclusions. They are divided into sentences using nklt. We then duplicate them and train using the same senrence for positive and anchor. We are using SimSCE.
175
+
176
  #### Unnamed Dataset
177
 
178
 
 
328
 
329
  | Epoch | Step | Training Loss |
330
  |:------:|:-----:|:-------------:|
 
 
 
 
 
 
 
 
 
 
 
331
  | 0.0077 | 100 | 0.0025 |
332
  | 0.0154 | 200 | 0.0032 |
333
  | 0.0231 | 300 | 0.0026 |
 
759
  }
760
  ```
761
 
762
+ #Add SimSCE reference
763
+
764
  <!--
765
  ## Glossary
766