PleIAs

Team

company

Activity Feed

AI & ML interests

Open Science LLMs

Recent Activity

Pclanglais updated a dataset 16 days ago

Pclanglais/EU-Science-Commons

Pclanglais published a dataset 16 days ago

Pclanglais/EU-Science-Commons

Pclanglais updated a dataset 19 days ago

PleIAs/telecom-knowledge-base

View all activity

Organization Card

Community About org cards

PleIAs is a French private AI Lab training the next generation of Language Models for document processing.

PleIAs is committed to open science and has coordinated the release of some of the largest open corpus for pre-training.

For more information, visit our website : https://pleias.fr/

Collections 11

View 11 collections

spaces 7

baguettotron_demo

📜

Vintage OCR Corrector (GPU)

📜

Correct OCR errors in your text

Vintage OCR Corrector (CPU)

📜

Correct OCR errors in text

Finance Commons Explorer

💻

Browse finance datasets on Hugging Face

Reversed-Zotero

📜

View 7 Spaces

models 30

datasets 59

PleIAs/telecom-knowledge-base

Viewer • Updated 19 days ago • 4.68M • 10

PleIAs/SYNTH

Viewer • Updated 25 days ago • 68M • 22.8k • 266

PleIAs/common_corpus

Viewer • Updated 26 days ago • 69.9k • 152k • 399

PleIAs/CommonLingua-Train

Viewer • Updated Apr 28 • 2.76M • 261 • 15

PleIAs/French-Science-Commons

Viewer • Updated Mar 19 • 42.6M • 8.73k • 20

PleIAs/BSF_Redline

Viewer • Updated Feb 27 • 1.05M • 292

PleIAs/Japanese-PD

Viewer • Updated Feb 16 • 1.38M • 885 • 1

PleIAs/Arabic-PD

Viewer • Updated Feb 16 • 221k • 130

PleIAs/verse-wikisource

Preview • Updated Nov 11, 2025 • 35 • 3

PleIAs/Youtube-Commons-Audio-Sample-1000

Updated Oct 11, 2025 • 9

View 59 datasets