File size: 701 Bytes
d08a3c7
 
 
 
 
 
 
 
 
1087fa8
 
4103d22
f24fcfb
4103d22
f24fcfb
4103d22
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
---
title: README
emoji: πŸŒ–
colorFrom: yellow
colorTo: pink
sdk: static
pinned: false
---

Find below the guidelines to push your datasets within the Open Catalogue of European Datasets. 

1️⃣ Learn how to push a dataset to the Hub πŸ‘‰ https://github.com/bigscience-workshop/data-preparation/tree/main/sourcing/Gathering%20Identified%20Datasets%20and%20Collections

2️⃣ Use the following tools & code to pre-process your datasets πŸ‘‰ https://github.com/bigscience-workshop/data-preparation/tree/main/preprocessing/training/01a_catalogue_cleaning_and_filtering

3️⃣ Push your processed dataset here using the following naming convention european-catalogue-data-processed-language-source