anilguven commited on
Commit
6c71ae7
1 Parent(s): 6f19e3d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -3
README.md CHANGED
@@ -1,3 +1,42 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - anilguven/turkish_news_dataset
5
+ language:
6
+ - tr
7
+ metrics:
8
+ - accuracy
9
+ - f1
10
+ tags:
11
+ - electra
12
+ - news
13
+ - classification
14
+ - text
15
+ ---
16
+
17
+ ### Information
18
+ This model was developed/finetuned for news classification task for the Turkish Language. This model was finetuned via news dataset. This dataset contains 7 classes: economy, magazine, sport, politics, technology, health, and events.
19
+
20
+ - LABEL_0: economy
21
+ - LABEL_1: magazine
22
+ - LABEL_2: health
23
+ - LABEL_3: politics
24
+ - LABEL_4: sports
25
+ - LABEL_5: technology
26
+ - LABEL_6: events
27
+
28
+ ### Model Sources
29
+ - **Dataset:** https://huggingface.co/datasets/anilguven/turkish_news_dataset
30
+ - **Paper:** peer review (Springer)
31
+ - **Finetuned from model::** https://huggingface.co/dbmdz/electra-base-turkish-cased-discriminator
32
+
33
+ ### Preprocessing
34
+ You must apply removing stopwords, stemming, or lemmatization process for Turkish.
35
+
36
+ ### Results
37
+ - Accuracy: %97.619
38
+ - F1-score: %97.617
39
+
40
+ ### Citation
41
+ BibTeX:
42
+ Peer review process