Violette HF staff commited on
Commit
f24fcfb
β€’
1 Parent(s): 4103d22

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -10,5 +10,7 @@ pinned: false
10
  Find below the guidelines to push your datasets within the Open Catalogue of European Datasets.
11
 
12
  1️⃣ Learn how to push a dataset to the Hub πŸ‘‰ https://github.com/bigscience-workshop/data-preparation/tree/main/sourcing/Gathering%20Identified%20Datasets%20and%20Collections
 
13
  2️⃣ Use the following tools & code to pre-process your datasets πŸ‘‰ https://github.com/bigscience-workshop/data-preparation/tree/main/preprocessing/training/01a_catalogue_cleaning_and_filtering
 
14
  3️⃣ Push your processed dataset here using the following naming convention european-catalogue-data-processed-language-source
 
10
  Find below the guidelines to push your datasets within the Open Catalogue of European Datasets.
11
 
12
  1️⃣ Learn how to push a dataset to the Hub πŸ‘‰ https://github.com/bigscience-workshop/data-preparation/tree/main/sourcing/Gathering%20Identified%20Datasets%20and%20Collections
13
+
14
  2️⃣ Use the following tools & code to pre-process your datasets πŸ‘‰ https://github.com/bigscience-workshop/data-preparation/tree/main/preprocessing/training/01a_catalogue_cleaning_and_filtering
15
+
16
  3️⃣ Push your processed dataset here using the following naming convention european-catalogue-data-processed-language-source