jordanpainter01
commited on
Commit
•
01ba8dc
1
Parent(s):
10c3f8f
Update README.md
Browse files
README.md
CHANGED
@@ -12,9 +12,8 @@ model-index:
|
|
12 |
- name: bertweet-base-finetuned-SARC-DS
|
13 |
results: []
|
14 |
---
|
15 |
-
|
16 |
-
|
17 |
-
should probably proofread and complete it, then remove this comment. -->
|
18 |
|
19 |
# bertweet-base-finetuned-SARC-DS
|
20 |
|
@@ -28,16 +27,13 @@ It achieves the following results on the evaluation set:
|
|
28 |
|
29 |
## Model description
|
30 |
|
31 |
-
The given description for BERTweet by VinAI is as follows:
|
32 |
BERTweet is the first public large-scale language model pre-trained for English Tweets. BERTweet is trained based on the RoBERTa pre-training procedure. The corpus used to pre-train BERTweet consists of 850M English Tweets (16B word tokens ~ 80GB), containing 845M Tweets streamed from 01/2012 to 08/2019 and 5M Tweets related to the COVID-19 pandemic.
|
33 |
-
|
34 |
-
## Intended uses & limitations
|
35 |
-
|
36 |
-
More information needed
|
37 |
|
38 |
## Training and evaluation data
|
39 |
|
40 |
-
|
41 |
|
42 |
## Training procedure
|
43 |
|
|
|
12 |
- name: bertweet-base-finetuned-SARC-DS
|
13 |
results: []
|
14 |
---
|
15 |
+
# Utilising Weak Supervision to Create S3D: A Sarcasm Annotated Dataset
|
16 |
+
This is the repository for the S3D dataset published at EMNLP 2022. The dataset can help build sarcasm detection models.
|
|
|
17 |
|
18 |
# bertweet-base-finetuned-SARC-DS
|
19 |
|
|
|
27 |
|
28 |
## Model description
|
29 |
|
30 |
+
The given description for BERTweet by VinAI is as follows: <br>
|
31 |
BERTweet is the first public large-scale language model pre-trained for English Tweets. BERTweet is trained based on the RoBERTa pre-training procedure. The corpus used to pre-train BERTweet consists of 850M English Tweets (16B word tokens ~ 80GB), containing 845M Tweets streamed from 01/2012 to 08/2019 and 5M Tweets related to the COVID-19 pandemic.
|
32 |
+
<br>
|
|
|
|
|
|
|
33 |
|
34 |
## Training and evaluation data
|
35 |
|
36 |
+
This [vinai/bertweet-base](https://huggingface.co/vinai/bertweet-base) model was finetuned on the [SARC](https://metatext.io/datasets/self-annotated-reddit-corpus-(sarc)) dataset. The dataset is intended to help build sarcasm detection models.
|
37 |
|
38 |
## Training procedure
|
39 |
|