Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -9,6 +9,6 @@ pinned: false
|
|
9 |
|
10 |
Find below the guidelines to push your datasets within the Open Catalogue of European Datasets.
|
11 |
|
12 |
-
|
13 |
-
|
14 |
-
|
|
|
9 |
|
10 |
Find below the guidelines to push your datasets within the Open Catalogue of European Datasets.
|
11 |
|
12 |
+
1️⃣ Learn how to push a dataset to the Hub 👉 https://github.com/bigscience-workshop/data-preparation/tree/main/sourcing/Gathering%20Identified%20Datasets%20and%20Collections
|
13 |
+
2️⃣ Use the following tools & code to pre-process your datasets 👉 https://github.com/bigscience-workshop/data-preparation/tree/main/preprocessing/training/01a_catalogue_cleaning_and_filtering
|
14 |
+
3️⃣ Push your processed dataset here using the following naming convention european-catalogue-data-processed-language-source
|