Spaces:
Running
Running
title: README | |
emoji: π | |
colorFrom: yellow | |
colorTo: pink | |
sdk: static | |
pinned: false | |
Find below the guidelines to push your datasets within the Open Catalogue of European Datasets. | |
1οΈβ£ Learn how to push a dataset to the Hub π https://github.com/bigscience-workshop/data-preparation/tree/main/sourcing/Gathering%20Identified%20Datasets%20and%20Collections | |
2οΈβ£ Use the following tools & code to pre-process your datasets π https://github.com/bigscience-workshop/data-preparation/tree/main/preprocessing/training/01a_catalogue_cleaning_and_filtering | |
3οΈβ£ Push your processed dataset here using the following naming convention european-catalogue-data-processed-language-source |