Spaces:
Running
Running
metadata
title: README
emoji: π
colorFrom: yellow
colorTo: pink
sdk: static
pinned: false
Find below the guidelines to push your datasets within the Open Catalogue of European Datasets.
1οΈβ£ Learn how to push a dataset to the Hub π https://github.com/bigscience-workshop/data-preparation/tree/main/sourcing/Gathering%20Identified%20Datasets%20and%20Collections
2οΈβ£ Use the following tools & code to pre-process your datasets π https://github.com/bigscience-workshop/data-preparation/tree/main/preprocessing/training/01a_catalogue_cleaning_and_filtering
3οΈβ£ Push your processed dataset here using the following naming convention european-catalogue-data-processed-language-source