Update README.md
Browse files
README.md
CHANGED
@@ -38,22 +38,15 @@ Abstraction Level: The model tends to be more extractive than abstractive in its
|
|
38 |
## Training and evaluation data
|
39 |
|
40 |
|
41 |
-
|
42 |
|
43 |
-
Source:
|
44 |
-
Size: Approximately
|
45 |
-
Time Range: 2007-
|
46 |
Language: English
|
47 |
-
Content:
|
48 |
-
|
49 |
-
|
50 |
-
Academic Articles Dataset:
|
51 |
-
|
52 |
-
Source: arXiv and PubMed Open Access Subset
|
53 |
-
Size: Approximately 150,000 articles
|
54 |
-
Time Range: 2010-2022
|
55 |
-
Language: English
|
56 |
-
Content: Research papers from various scientific fields including physics, mathematics, computer science, and biomedical sciences
|
57 |
|
58 |
|
59 |
Pre-processing Steps:
|
|
|
38 |
## Training and evaluation data
|
39 |
|
40 |
|
41 |
+
Dataset:
|
42 |
|
43 |
+
Source: PARANMT-50M
|
44 |
+
Size: Approximately 50M
|
45 |
+
Time Range: 2007-2017
|
46 |
Language: English
|
47 |
+
Content: more than 50 million English-English
|
48 |
+
sentential paraphrase pairs
|
49 |
+
https://arxiv.org/pdf/1711.05732v2
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
50 |
|
51 |
|
52 |
Pre-processing Steps:
|