cjvt
/

matejklemen's picture
Add multilingual to the language tag (#1)
d9fedb5
|
raw
history blame
703 Bytes
---
language:
- sl
- hr
- en
- multilingual
license: cc-by-4.0
---
# crosloengual-bert-si-nli
CroSloEngual BERT model finetuned on the SI-NLI dataset for Slovene natural language inference.
Fine-tuned in a classic sequence pair classification setting on the official training/validation/test split for 10 epochs, using validation set accuracy for model selection.
Optimized using the AdamW optimizer (learning rate 2e-5) and cross-entropy loss.
Using batch size `82` (selected based on the available GPU memory) and maximum sequence length `107` (99th percentile of the lengths in the training set).
Achieves the following metrics:
- best validation accuracy: `0.660`
- test accuracy = `0.673`