|
--- |
|
language: |
|
- ku |
|
metrics: |
|
- perplexity |
|
pipeline_tag: fill-mask |
|
--- |
|
|
|
# KurdBERT |
|
- Type: BERT-based language model transformer |
|
- Description: KurdBERT is a pre-trained language model specifically tailored for the Kurdish languages (Sorani and Kurmanji). It can be used for various NLP tasks, including text classification, named entity recognition, and more. |
|
|
|
## How to use |
|
```python |
|
# Use a pipeline as a high-level helper |
|
from transformers import pipeline |
|
|
|
pipe = pipeline("fill-mask", model="language-ml-lab/KurdBert") |
|
``` |
|
|
|
```python |
|
# Load model directly |
|
from transformers import AutoTokenizer, AutoModelForMaskedLM |
|
|
|
tokenizer = AutoTokenizer.from_pretrained("language-ml-lab/KurdBert") |
|
model = AutoModelForMaskedLM.from_pretrained("language-ml-lab/KurdBert") |
|
``` |