README.md · anilguven/electra_tr_turkish

metadata

license: mit
datasets:
  - anilguven/turkish_news_dataset
language:
  - tr
metrics:
  - accuracy
  - f1
tags:
  - electra
  - news
  - classification
  - text

Information

This model was developed/finetuned for news classification task for the Turkish Language. This model was finetuned via news dataset. This dataset contains 7 classes: economy, magazine, sport, politics, technology, health, and events.

LABEL_0: economy
LABEL_1: magazine
LABEL_2: health
LABEL_3: politics
LABEL_4: sports
LABEL_5: technology
LABEL_6: events

Model Sources

Dataset: https://huggingface.co/datasets/anilguven/turkish_news_dataset
Paper: peer review (Springer)
Finetuned from model:: https://huggingface.co/dbmdz/electra-base-turkish-cased-discriminator

Preprocessing

You must apply removing stopwords, stemming, or lemmatization process for Turkish.

Results

Accuracy: %97.619
F1-score: %97.617

Citation

BibTeX: Peer review process