tomaarsen
/

span-marker-bert-base-cross-ner

Token Classification

named-entity-recognition

Model card Files Files and versions Metrics Training metrics Community

tomaarsen HF staff commited on Aug 15, 2023

Commit

af457e5

•

1 Parent(s): 97226a6

Add training set info

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -42,6 +42,7 @@ metrics:
 # SpanMarker for Named Entity Recognition
 This is a [SpanMarker](https://github.com/tomaarsen/SpanMarkerNER) model that can be used for Named Entity Recognition. In particular, this SpanMarker model uses [bert-base-cased](https://huggingface.co/bert-base-cased) as the underlying encoder. See [train.py](train.py) for the training script.
 Is your data not (always) capitalized correctly? Then consider using the uncased variant of this model instead for better performance:
 [tomaarsen/span-marker-bert-base-uncased-cross-ner](https://huggingface.co/tomaarsen/span-marker-bert-base-uncased-cross-ner).

 # SpanMarker for Named Entity Recognition
 This is a [SpanMarker](https://github.com/tomaarsen/SpanMarkerNER) model that can be used for Named Entity Recognition. In particular, this SpanMarker model uses [bert-base-cased](https://huggingface.co/bert-base-cased) as the underlying encoder. See [train.py](train.py) for the training script.
+It is trained on [P3ps/Cross_ner](https://huggingface.co/datasets/P3ps/Cross_ner), which I believe is a variant of [DFKI-SLT/cross_ner](https://huggingface.co/datasets/DFKI-SLT/cross_ner) that marged the validation set into the training set and applied deduplication.
 Is your data not (always) capitalized correctly? Then consider using the uncased variant of this model instead for better performance:
 [tomaarsen/span-marker-bert-base-uncased-cross-ner](https://huggingface.co/tomaarsen/span-marker-bert-base-uncased-cross-ner).