File size: 1,690 Bytes
0449f98
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
---
language:
- pl
base_model:
- allegro/plt5-base
datasets:
- scilons/SciLaD-all-text-v1
library_name: transformers
tags:
- t5
- seq2seq
- text-to-text
- scientific-language-models
- cross-lingual-transfer
- wechsel
- global-mmlu
---

# PL-Base-CP

Polish monolingual base model continued on the SciLaD target-language split as a 15k-step control baseline.

## Model Details

This is a monolingual continued-pretraining control checkpoint reported in the paper table. It is provided for reproducibility of the baseline comparison.

- Paper name: `PL-Base-CP`
- Model role: `baseline-control`
- Source/base model: [allegro/plt5-base](https://huggingface.co/allegro/plt5-base)
- Code and pipeline: [GitHub repository](https://github.com/nikolas-rauscher/scientific-english-crosslingual-transfer)
- Architecture: T5 encoder-decoder
- SciLaD dataset: [scilons/SciLaD-all-text-v1](https://huggingface.co/datasets/scilons/SciLaD-all-text-v1)
- Evaluation benchmark: [Global-MMLU](https://huggingface.co/datasets/CohereLabs/Global-MMLU)

## Evaluation

Zero-shot Global-MMLU accuracy reported by the paper aggregation:

| Metric | Accuracy |
|---|---:|
| Average | 24.65 |
| STEM | 23.88 |
| Humanities | 24.51 |
| Social Sciences | 23.43 |
| Other | 26.87 |

## Limitations

The model is evaluated primarily with zero-shot Global-MMLU. Downstream task-specific
evaluation is recommended before deployment in specialized scientific workflows.

## Citation

- Title: Transferring Scientific English Pre-Trained Language Models to Multiple Languages Using Cross-Lingual Transfer
- Authors: Nikolas Rauscher, Fabio Barth, Georg Rehm
- Venue: LREC-COLING 2026, citation details TBA after publication