SimoneAstarita
commited on
Commit
•
6aa86af
1
Parent(s):
f789c03
Update README.md
Browse files
README.md
CHANGED
@@ -79,7 +79,7 @@ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [S
|
|
79 |
- **Maximum Sequence Length:** 512 tokens
|
80 |
- **Output Dimensionality:** 384 tokens
|
81 |
- **Similarity Function:** Cosine Similarity
|
82 |
-
|
83 |
- **Language:** en
|
84 |
- **License:** apache-2.0
|
85 |
|
@@ -171,6 +171,8 @@ You can finetune this model on your own dataset.
|
|
171 |
|
172 |
### Training Dataset
|
173 |
|
|
|
|
|
174 |
#### Unnamed Dataset
|
175 |
|
176 |
|
@@ -326,17 +328,6 @@ You can finetune this model on your own dataset.
|
|
326 |
|
327 |
| Epoch | Step | Training Loss |
|
328 |
|:------:|:-----:|:-------------:|
|
329 |
-
| 0.0077 | 100 | 0.4784 |
|
330 |
-
| 0.0154 | 200 | 0.2415 |
|
331 |
-
| 0.0231 | 300 | 0.0424 |
|
332 |
-
| 0.0307 | 400 | 0.021 |
|
333 |
-
| 0.0384 | 500 | 0.0149 |
|
334 |
-
| 0.0461 | 600 | 0.0081 |
|
335 |
-
| 0.0538 | 700 | 0.0084 |
|
336 |
-
| 0.0615 | 800 | 0.0067 |
|
337 |
-
| 0.0692 | 900 | 0.0034 |
|
338 |
-
| 0.0769 | 1000 | 0.0025 |
|
339 |
-
| 0.0846 | 1100 | 0.0016 |
|
340 |
| 0.0077 | 100 | 0.0025 |
|
341 |
| 0.0154 | 200 | 0.0032 |
|
342 |
| 0.0231 | 300 | 0.0026 |
|
@@ -768,6 +759,8 @@ You can finetune this model on your own dataset.
|
|
768 |
}
|
769 |
```
|
770 |
|
|
|
|
|
771 |
<!--
|
772 |
## Glossary
|
773 |
|
|
|
79 |
- **Maximum Sequence Length:** 512 tokens
|
80 |
- **Output Dimensionality:** 384 tokens
|
81 |
- **Similarity Function:** Cosine Similarity
|
82 |
+
- **Training Dataset:** scraped astronomy papers at the NLP for Space Science workshop.
|
83 |
- **Language:** en
|
84 |
- **License:** apache-2.0
|
85 |
|
|
|
171 |
|
172 |
### Training Dataset
|
173 |
|
174 |
+
The dataset is made of scrapes papers in astronomy, including abstract, introduction and conclusions. They are divided into sentences using nklt. We then duplicate them and train using the same senrence for positive and anchor. We are using SimSCE.
|
175 |
+
|
176 |
#### Unnamed Dataset
|
177 |
|
178 |
|
|
|
328 |
|
329 |
| Epoch | Step | Training Loss |
|
330 |
|:------:|:-----:|:-------------:|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
331 |
| 0.0077 | 100 | 0.0025 |
|
332 |
| 0.0154 | 200 | 0.0032 |
|
333 |
| 0.0231 | 300 | 0.0026 |
|
|
|
759 |
}
|
760 |
```
|
761 |
|
762 |
+
#Add SimSCE reference
|
763 |
+
|
764 |
<!--
|
765 |
## Glossary
|
766 |
|