Collection of Romanian models based on Llama2
OpenLLM-Ro
community
AI & ML interests
None defined yet.
Recent Activity
Organization Card
The goal of the OpenLLM-Ro is to bring together the Romanian community that builds open Romanian models and to collect these models in a single place.
We value:
- using public and open corpora
- open-source training and evaluation code.
In this organization, you can find RoLLM models, based on different underlying models and in different flavours (i.e., foundational, instruct, or chat variants):
- RoLlama2: Romanian models based on Llama2
- RoMistral: Romanian models based on Mistral
- RoLlama3: Romanian models based on Llama3
- RoLlama3.1: Romanian models based on Llama3.1
- RoGemma: Romanian models based on Gemma
- RoGemma2: Romanian models based on Gemma2
- RoLlava: Romanian models based on Llava
- RoQwen2-VL: Romanian models based on Qwen2-VL
- RoQwen2.5-VL: Romanian models based on Qwen2.5-VL
- RoQwen3-VL: Romanian models based on Qwen3-VL
- RoGemma3: Romanian models based on Gemma3
Furthermore, here you can find data used to train and evaluate LLMs & VLMs in Romanian. Currently, there are four data collections:
- Pretraining dataset: Romanian pretraining quality-filtered data
- SFT datasets: data used for supervised (instruction) finetuning
- Alignment datasets: data used mainly for Direct Preference Optimization (DPO)
- Evaluation datasets: data used for evaluating models in Romanian
See details in https://arxiv.org/abs/2406.18266 and https://arxiv.org/abs/2605.31401.
- 23-04-2025: we increased the datasets used for supervised finetuning with high-quality data generated using Magpie (RoMagpie-Reasoning and RoMagpie-Pro-MT), and greatly increase the size of the alignment dataset by adding high-quality datasets (RoUltraFeedback, RoMagpie-DPO, RoArgillaMagpieUltra and RoHelpSteer2)
- 28-11-2025: we release augmented pretraining data and quality classifier
- 05-06-2026: we release RoVLMs together with training and evaluation data
- We encourage the community to engage in discussions (to provide feedback, ask questions, or make improvement suggestions) in Hugging Face or GitHub.
We will also organize physical meetings (announced in advance) to brainstorm ideas, roadmap, and other technical aspects.
Extra info: check also the work by the Faur AI team
models 54
OpenLLM-Ro/RoGemma3-4B-Instruct
Image-Text-to-Text • 5B • Updated • 2
OpenLLM-Ro/RoQwen3-VL-2B-Instruct
Image-Text-to-Text • 2B • Updated • 2
OpenLLM-Ro/RoQwen2.5-VL-3B-Instruct
Image-Text-to-Text • 4B • Updated • 2
OpenLLM-Ro/RoQwen2-VL-2B-Instruct
Image-Text-to-Text • 2B • Updated • 2
OpenLLM-Ro/RoLlava-Next-Llama3-8B-Instruct
Image-Text-to-Text • 603k • Updated
OpenLLM-Ro/RoGemma-7b-Instruct-DPO-2024-10-09
9B • Updated • 22
OpenLLM-Ro/RoGemma-7b-Instruct-2024-06-28
Text Generation • 9B • Updated • 28 • 1
OpenLLM-Ro/RoGemma-7b-Instruct-2024-10-09
9B • Updated • 27
OpenLLM-Ro/RoGemma-7b-Instruct-2025-04-23
9B • Updated • 42
OpenLLM-Ro/RoGemma-7b-Instruct-DPO
9B • Updated • 27 • 1
datasets 54
OpenLLM-Ro/ro_sft_finepdfs
Viewer • Updated • 379k
OpenLLM-Ro/ro_sft_cosyn
Viewer • Updated • 427k • 10
OpenLLM-Ro/ro_sft_flickr30k_qa
Viewer • Updated • 25.4k • 9
OpenLLM-Ro/ro_sft_flickr30k_cap
Viewer • Updated • 25.4k • 13
OpenLLM-Ro/ro_sft_pixmo_cap
Viewer • Updated • 608k • 28
OpenLLM-Ro/ro_sft_pixmo_points
Viewer • Updated • 188k • 45
OpenLLM-Ro/ro_sft_pixmo_count
Viewer • Updated • 34.6k • 22 • 1
OpenLLM-Ro/ro_sft_pixmo_cap_qa
Viewer • Updated • 207k • 30
OpenLLM-Ro/ro_sft_pixmo_aa
Viewer • Updated • 133k • 58
OpenLLM-Ro/ro_sft_llava_mix
Viewer • Updated • 625k • 86 • 1