jaygala24
/

distilroberta-base-finetuned-fake-news-english

Text Classification

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

jaygala24 commited on Apr 2, 2022

Commit

9a55d3d

•

1 Parent(s): a056fe2

Update README.md

Files changed (1) hide show

README.md +8 -6

README.md CHANGED Viewed

@@ -26,17 +26,19 @@ It achieves the following results on the evaluation set:
 - Recall: 1.0
 - Auc: 0.9997
-## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure

 - Recall: 1.0
 - Auc: 0.9997
 ## Intended uses & limitations
+The model may not work with the articles over 512 tokens after preprocessing as the model's context is restricted to a maximum of 512 tokens in the sequence.
 ## Training and evaluation data
+The [fake-and-real news](https://www.kaggle.com/datasets/clmentbisaillon/fake-and-real-news-dataset) dataset contains a total of 44,898 annotated articles with 21,417 real and 23,481 fake. The dataset was stratified split into train, validation, and test subsets with a proportion of 60:20:20 respectively. The model was finetuned on train subset and evaluated on validation and test subsets.
+| Split      | # examples |
+|:----------:|:----------:|
+| train      | 17959      |
+| validation | 13469      |
+| test       | 13470      |
 ## Training procedure