Weakly Supervised Multi-lingual NER pipeline
Collection
This repository contains the Named Entity Recognition (NER) pipeline for symptom extraction, focusing on multilingual capabilities.
•
16 items
•
Updated
This model had been created as part of joint research of HUMADEX research group (https://www.linkedin.com/company/101563689/) and has received funding by the European Union Horizon Europe Research and Innovation Program project SMILE (grant number 101080923) and Marie Skłodowska-Curie Actions (MSCA) Doctoral Networks, project BosomShield ((rant number 101073222). Responsibility for the information and views expressed herein lies entirely with the authors. Authors: dr. Izidor Mlakar, Rigon Sallauka, dr. Umut Arioz, dr. Matej Rojc
PROBLEM
: Diseases, symptoms, and medical conditions.TEST
: Diagnostic procedures and laboratory tests.TREATMENT
: Medications, therapies, and other medical interventions.Visit HUMADEX/Weekly-Supervised-NER-pipline for more info.
You can easily use this model with the Hugging Face transformers
library. Here's an example of how to load and use the model for inference:
from transformers import AutoTokenizer, AutoModelForTokenClassification
model_name = "HUMADEX/german_medical_ner"
# Load the tokenizer and model
tokenizer = AutoTokenizer.from_pretrained(model_name)
model = AutoModelForTokenClassification.from_pretrained(model_name)
# Sample text for inference
text = "Der Patient klagte über starke Kopfschmerzen und Übelkeit, die seit zwei Tagen anhielten. Zur Linderung der Symptome wurde ihm Paracetamol verschrieben, und er wurde angewiesen, sich auszuruhen und viel Flüssigkeit zu trinken."
# Tokenize the input text
inputs = tokenizer(text, return_tensors="pt")
Base model
google-bert/bert-base-cased