Aaron Chibb

aari1995

AI & ML interests

Multilinguality and German LLMs

Recent Activity

liked a dataset 12 days ago
gretelai/gretel-pii-masking-en-v1
liked a model 17 days ago
UKPLab/triple-encoders-dailydialog
liked a model 27 days ago
ibm-granite/granite-3.0-8b-instruct

Organizations

Posts 3

view post
Post
3227
ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH

mLLM - first release:
orca_dpo_pairs by Intel (translated into 7 languages)

ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH

Upcoming:
- more datasets
- cleaning steps
- a blogpost
- stay updated at https://hf.co/multilingual

multilingual/orca_dpo_pairs
view post
Post
looking at the tokenizer and the naming (β€œ_enβ€œ), Google Gemma is very likely to have a multilingual counterpart. πŸ‘€

Thoughts?