Transformers
Safetensors
Polish
t5
text2text-generation
seq2seq
text-to-text
scientific-language-models
cross-lingual-transfer
wechsel
global-mmlu
text-generation-inference
Instructions to use rausch/pl-t5-base-sci-cp-15k with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use rausch/pl-t5-base-sci-cp-15k with Transformers:
# Load model directly from transformers import AutoTokenizer, AutoModelForSeq2SeqLM tokenizer = AutoTokenizer.from_pretrained("rausch/pl-t5-base-sci-cp-15k") model = AutoModelForSeq2SeqLM.from_pretrained("rausch/pl-t5-base-sci-cp-15k") - Notebooks
- Google Colab
- Kaggle
File size: 1,690 Bytes
0449f98 | 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 | ---
language:
- pl
base_model:
- allegro/plt5-base
datasets:
- scilons/SciLaD-all-text-v1
library_name: transformers
tags:
- t5
- seq2seq
- text-to-text
- scientific-language-models
- cross-lingual-transfer
- wechsel
- global-mmlu
---
# PL-Base-CP
Polish monolingual base model continued on the SciLaD target-language split as a 15k-step control baseline.
## Model Details
This is a monolingual continued-pretraining control checkpoint reported in the paper table. It is provided for reproducibility of the baseline comparison.
- Paper name: `PL-Base-CP`
- Model role: `baseline-control`
- Source/base model: [allegro/plt5-base](https://huggingface.co/allegro/plt5-base)
- Code and pipeline: [GitHub repository](https://github.com/nikolas-rauscher/scientific-english-crosslingual-transfer)
- Architecture: T5 encoder-decoder
- SciLaD dataset: [scilons/SciLaD-all-text-v1](https://huggingface.co/datasets/scilons/SciLaD-all-text-v1)
- Evaluation benchmark: [Global-MMLU](https://huggingface.co/datasets/CohereLabs/Global-MMLU)
## Evaluation
Zero-shot Global-MMLU accuracy reported by the paper aggregation:
| Metric | Accuracy |
|---|---:|
| Average | 24.65 |
| STEM | 23.88 |
| Humanities | 24.51 |
| Social Sciences | 23.43 |
| Other | 26.87 |
## Limitations
The model is evaluated primarily with zero-shot Global-MMLU. Downstream task-specific
evaluation is recommended before deployment in specialized scientific workflows.
## Citation
- Title: Transferring Scientific English Pre-Trained Language Models to Multiple Languages Using Cross-Lingual Transfer
- Authors: Nikolas Rauscher, Fabio Barth, Georg Rehm
- Venue: LREC-COLING 2026, citation details TBA after publication
|