---
language: en
tags:
- ner
- bert
- mountain-ner
- named-entity-recognition
license: mit
datasets:
- NERetrieve
- Few-NERD
- mountain-ner-dataset
metrics:
- accuracy
- f1
- precision
- recall
pipeline_tag: token-classification
model-index:
- name: mountain-ner-bert-base
  results:
  - task:
      type: token-classification
      name: Named Entity Recognition
    dataset:
      name: mountain-ner-dataset
      type: Gepe55o/mountain-ner-dataset
    metrics:
      - type: accuracy
        value: 0.9919
      - type: f1
        value: 0.9048
      - type: precision
        value: 0.8899
      - type: recall
        value: 0.9202
---
## Model Description
mountain-ner-bert-base is a fine-tuned model based on the BERT base architecture for mountain names Entity Recognition tasks. The model is trained on the merging of two datasets: [NERetrieve](https://arxiv.org/pdf/2310.14282), [Few-NERD](https://arxiv.org/pdf/2105.07464v6), [Mountain-ner-dataset](https://huggingface.co/datasets/Gepe55o/mountain-ner-dataset). The model is trained to recognize two types of entities: `LABEL_0` (other), `LABEL_1` (mountain names).

- Model Architecture: BERT base
- Task: mountain names entity recognition
- Training Data: [mountain-ner-dataset](https://huggingface.co/datasets/Gepe55o/mountain-ner-dataset)

## Performance
Metrics: 
| Epoch | Training Loss | Validation Loss | Accuracy | Precision | Recall   | F1       |
|-------|---------------|----------------|----------|-----------|----------|----------|
| 1     | 0.027400      | 0.030793       | 0.988144 | 0.815692  | 0.924621 | 0.866748 |
| 2     | 0.020600      | 0.024568       | 0.991119 | 0.872988  | 0.921036 | 0.896369 |
| 3     | 0.012900      | 0.024072       | 0.991923 | 0.889878  | 0.920171 | 0.904771 |


Best model performance achieved at epoch 3 with:
- F1 Score: 0.9048
- Accuracy: 0.9919
- Precision: 0.8899
- Recall: 0.9202

## How to use
```python
from transformers import AutoModel, AutoTokenizer, pipeline

model = AutoModel.from_pretrained("Gepe55o/mountain-ner-bert-base")
tokenizer = AutoTokenizer.from_pretrained("Gepe55o/mountain-ner-bert-base")

text = "Mount Everest is the highest mountain in the world."

nlp = pipeline("ner", model=model, tokenizer=tokenizer)
result = nlp(text)
```