File size: 1,895 Bytes
de971f3
 
 
 
e70d6a5
de971f3
 
 
 
e70d6a5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1cdaf07
de971f3
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
---
language:
- en
pipeline_tag: token-classification
license: apache-2.0
---

Named Entity Recognition (NER) model to recognize disease entities. 

Please cite our work:

```
@article{NILNKER2022,
  title = {NILINKER: Attention-based approach to NIL Entity Linking},
  journal = {Journal of Biomedical Informatics},
  volume = {132},
  pages = {104137},
  year = {2022},
  issn = {1532-0464},
  doi = {https://doi.org/10.1016/j.jbi.2022.104137},
  url = {https://www.sciencedirect.com/science/article/pii/S1532046422001526},
  author = {Pedro Ruas and Francisco M. Couto},
}
```

[PubMedBERT](https://huggingface.co/microsoft/BiomedNLP-PubMedBERT-base-uncased-abstract-fulltext) fine-tuned on the following datasets:

- [NCBI Disease Corpus](https://www.ncbi.nlm.nih.gov/research/bionlp/Data/disease/) (train and dev sets)
- [PHAEDRA](http://www.nactem.ac.uk/PHAEDRA/) (train, dev, test sets): entity type "Disorder"
- [Corpus for Disease Names and Adverse Effects](https://www.scai.fraunhofer.de/en/business-research-areas/bioinformatics/downloads/corpus-for-disease-names-and-adverse-effects.html) (train, dev, test sets): entity types "DISEASE", "ADVERSE"
- [RareDis corpus](https://github.com/isegura/NLP4RARE-CM-UC3M/tree/main/corpus) (train, dev, test sets): entity types "DISEASE", "RAREDISEASE", "SYMPTOM"
- [CoMAGC](https://github.com/isegura/NLP4RARE-CM-UC3M/tree/main/corpus) (train, dev, test sets): entity type "cancer_term"
- [PGxCorpus](https://www.nature.com/articles/s41597-019-0342-9) (train, dev, test sets):
- [miRNA-Test-Corpus](https://www.scai.fraunhofer.de/en/business-research-areas/bioinformatics/downloads/download-mirna-test-corpus.html) (train, dev, test sets): entity type "Diseases"
- [BC5CDR]() (train and dev sets): entity type "Disease"
- [Mantra](https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4986661/pdf/ocv037.pdf) (train, dev, test sets): entity type "DISO"