A collection of resources for evaluation of LLM capabilities in the Estonian language.
AI & ML interests
Natural Language Processing
Recent Activity
Papers
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation
GliLem: Leveraging GliNER for Contextualized Lemmatization in Estonian
Organization Card
We are the research group of natural language processing at the Institute of Computer Science, University of Tartu. Our areas of focus include machine translation, speech synthesis, NLP for Estonian and others.
Llama-2-based LLMs fine-tuned for grammatical error correction. This collection also includes AEG (Artificial Error Generation) models.
-
To Err Is Human, but Llamas Can Learn It Too
Paper • 2403.05493 • Published • 6 -
tartuNLP/Llamma-2-7b-ukr-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 5 • 2 -
tartuNLP/Llammas-base-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 17 • 3 -
tartuNLP/leo-hessianai-7b-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 8
A collection of resources for evaluation of LLM capabilities in the Estonian language.
Llama-2-based LLMs fine-tuned for grammatical error correction. This collection also includes AEG (Artificial Error Generation) models.
-
To Err Is Human, but Llamas Can Learn It Too
Paper • 2403.05493 • Published • 6 -
tartuNLP/Llamma-2-7b-ukr-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 5 • 2 -
tartuNLP/Llammas-base-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 17 • 3 -
tartuNLP/leo-hessianai-7b-p1-llama-errors-p2-GEC
Text Generation • 7B • Updated • 8
models
65
tartuNLP/smugri4-mt
Text Generation
•
3B
•
Updated
•
23
tartuNLP/llama-estllm-prototype-0825
Text Generation
•
8B
•
Updated
•
342
•
2
tartuNLP/EstRoBERTa
Feature Extraction
•
0.3B
•
Updated
•
89
tartuNLP/Llammas
Text Generation
•
7B
•
Updated
•
228
•
7
tartuNLP/EstBERT_NER_v2
Token Classification
•
0.1B
•
Updated
•
194
•
1
tartuNLP/EstBERT_NER
Token Classification
•
0.1B
•
Updated
•
222
tartuNLP/EstBERT
Fill-Mask
•
0.1B
•
Updated
•
404
•
•
4
tartuNLP/Qwen2.5-3B-Instruct-hsb-dsb
Text Generation
•
3B
•
Updated
•
5
tartuNLP/mmBERT-small-m-edu-classifier
Text Classification
•
0.1B
•
Updated
•
27
•
1
tartuNLP/XTTS-v2-est
Updated
•
13
datasets
29
tartuNLP/winogrande_et
Viewer
•
Updated
•
6.59k
•
232
tartuNLP/EstCOPA
Viewer
•
Updated
•
2k
•
111
•
2
tartuNLP/EstNER
Viewer
•
Updated
•
46.1k
•
110
tartuNLP/ifeval_et
Viewer
•
Updated
•
541
•
54
tartuNLP/Estonian_Subjectivity
Viewer
•
Updated
•
1k
•
46
tartuNLP/finepdfs-et
Viewer
•
Updated
•
554k
•
331
tartuNLP/lumiopen-hpltv2-llama33-edu-annotation-et
Viewer
•
Updated
•
500k
•
140
tartuNLP/fineweb-c-combined
Viewer
•
Updated
•
509k
•
73
tartuNLP/fineweb-c-combined-resample
Viewer
•
Updated
•
46.8k
•
43
tartuNLP/word_meanings_et_multiple_choice
Viewer
•
Updated
•
997
•
40