Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,19 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
pipeline_tag: token-classification
|
5 |
+
---
|
6 |
+
|
7 |
+
Named Entity Recognition (NER) model to recognize disease entities.
|
8 |
+
|
9 |
+
[PubMedBERT](https://arxiv.org/pdf/2007.15779.pdf) fine-tuned on the following datasets:
|
10 |
+
|
11 |
+
- [NCBI Disease Corpus](https://www.ncbi.nlm.nih.gov/research/bionlp/Data/disease/) (train and dev sets)
|
12 |
+
- [PHAEDRA](http://www.nactem.ac.uk/PHAEDRA/) (train, dev, test sets): entity type "Disorder"
|
13 |
+
- [Corpus for Disease Names and Adverse Effects](https://www.scai.fraunhofer.de/en/business-research-areas/bioinformatics/downloads/corpus-for-disease-names-and-adverse-effects.html) (train, dev, test sets): entity types "DISEASE", "ADVERSE"
|
14 |
+
- [RareDis corpus](https://github.com/isegura/NLP4RARE-CM-UC3M/tree/main/corpus) (train, dev, test sets): entity types "DISEASE", "RAREDISEASE", "SYMPTOM"
|
15 |
+
- [CoMAGC](https://github.com/isegura/NLP4RARE-CM-UC3M/tree/main/corpus) (train, dev, test sets): entity type "cancer_term"
|
16 |
+
- [PGxCorpus](https://www.nature.com/articles/s41597-019-0342-9) (train, dev, test sets):
|
17 |
+
- [miRNA-Test-Corpus](https://www.scai.fraunhofer.de/en/business-research-areas/bioinformatics/downloads/download-mirna-test-corpus.html) (train, dev, test sets): entity type "Diseases"
|
18 |
+
- [BC5CDR]() (train and dev sets): entity type "Disease"
|
19 |
+
- [Mantra](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4986661/pdf/ocv037.pdf) (train, dev, test sets): entity type "DISO"
|