Transformers
Inference Endpoints
loubnabnl HF staff commited on
Commit
6f9fdab
1 Parent(s): c3735a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -1,13 +1,13 @@
1
  ---
2
  library_name: transformers
3
- tags: []
 
4
  ---
5
 
6
- # Model Card for Model ID
7
  Tokenizer for the training of cosmo2. This tokenizer was trained on 1M samples from:
8
  - FineWeb-Edu 80%
9
  - Cosmopedia v2 15%
10
  - StarCoderData 8%
11
  - OpenWebMath 5%
12
- - StackOverFlow 2%
13
-
 
1
  ---
2
  library_name: transformers
3
+ datasets:
4
+ - HuggingFaceTB/cosmo2_training_data_subset_1M
5
  ---
6
 
7
+ # cosmo2-tokenizer
8
  Tokenizer for the training of cosmo2. This tokenizer was trained on 1M samples from:
9
  - FineWeb-Edu 80%
10
  - Cosmopedia v2 15%
11
  - StarCoderData 8%
12
  - OpenWebMath 5%
13
+ - StackOverFlow 2%