DT4H_XLM-R_mtl_es-it-ro_symptom

Model Description

This multilingual clinical Named Entity Recognition (NER) model is designed to identify symptom mentions in biomedical and clinical text. It is based on xlm-roberta-base and fine-tuned on translated variants of the clinical NER datasets SympTEMIST and CardioCCC, which consist of clinical case reports with manually annotated mentions of signs and symptoms, following a multi-task learning (MTL) approach and using the BIO tagging scheme for sequence labeling.

The model consists of a shared multilingual encoder and a set of language-specific token classification heads, each one being responsible for a different task. In this configuration, each classification head is trained on language-specific data manually annotated with symptom mentions.

  • Architecture: Multi-task learning (MTL)
  • Training setup: Multilingual, Monolabel (SYMPTOM)
  • Tasks: symptom_es_ner, symptom_it_ner, symptom_ro_ner
  • Supported languages:
    • Spanish (es)
    • Italian (it)
    • Romanian (ro)
  • Base model: xlm-roberta-base
  • Task: Token classification (NER)
  • Label scheme: BIO

Training Data

The model is trained on multilingual clinical NER data from the SympTEMIST and CardioCCC datasets across the supported languages. In this MTL set up, each classification head is trained to perform a specific task, which is the identification of symptom mentions in clinical and biomedical texts written in its assigned language. For that, each classification head is fine-tuned on language-specific data in which symptom mentions have been manually labeled. During training, the shared part of the model learns general medical and linguistic patterns across all languages while each classification head is fine-tuned on language-specific annotated data, so it becomes better at recognizing symptom names in that particular language.

The training data is provided as part of the MultiClinNER subtask of the MultiClinAI shared task, an initiative as part of the DataTools4Heart (DT4H) project, which provides translated and annotation-projected clinical corpora.

Training and test splits correspond to the MultiClinNER task at the 11th SMM4H-HeaRD Workshop (ACL 2026).

Model loading

This model uses a custom MTL architecture, and therefore cannot be loaded with:

AutoModelForTokenClassification.from_pretrained(...)

Instead, this repository provides a PyTorch checkpoint (.pt) which includes:

  • Encoder weights
  • All language-specific heads

How to use

To use the model:

  1. Download the .pt file
  2. Load it using the custom architecture

To facilitate this process, we provide an inference script in a GitHub repository that:

  • Loads the model from the checkpoint using the custom architecture
  • Processes .txt files from an input directory
  • Identifies symptom mentions in the language handled by the selected classification head (task)
  • Exports predictions as a TSV file in the format required for the MultiClinAI evaluation library:
filename                               label       start_span     end_span     text
MultiClinNER-it-test-symptom-59817     SYMPTOM     4017           4036         dolenzia addominale

Note: We recommend pre-tokenizing the input text into words, as this matches the training setup. Providing raw text directly may lead to slightly degraded performance.

Limitations and bias

At the time of submission, no formal bias or fairness evaluation has been conducted. We intend to conduct research in these areas in the future, and if completed, this model card will be updated.

Evaluation

Evaluation was conducted using strict (exact match) and character-level metrics on the MultiClinNER test set.

Language Strict P Strict R Strict F1 Char P Char R Char F1
es 0.5984 0.5780 0.5880 0.7581 0.7283 0.7429
it 0.6213 0.5205 0.5664 0.7715 0.6419 0.7008
ro 0.6432 0.5847 0.6126 0.7731 0.7004 0.7349
Average 0.5890 0.7262

Additional information

Authors

NLP4BIA team at the Barcelona Supercomputing Center (nlp4bia@bsc.es).

Contact information

judith.rosell [at] bsc.es

Funding

This model is part of the DataTools4Heart project, funded by the European Union’s Horizon Europe Framework Under Grant Agreement No. 101057849.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for BSC-NLP4BIA/DT4H_XLM-R_mtl_es-it-ro_symptom

Finetuned
(3983)
this model

Collection including BSC-NLP4BIA/DT4H_XLM-R_mtl_es-it-ro_symptom