Shushant Pudasaini commited on
Commit
7756f55
1 Parent(s): 8d3e10b

data description

Browse files

data description added

Files changed (1) hide show
  1. README.md +4 -0
README.md CHANGED
@@ -12,3 +12,7 @@ from transformers import pipeline
12
  fill_mask = pipeline( "fill-mask", model=model, tokenizer=tokenizer, )
13
  from pprint import pprint pprint(fill_mask(f"तिमीलाई कस्तो {tokenizer.mask_token}."))
14
  ```
 
 
 
 
 
12
  fill_mask = pipeline( "fill-mask", model=model, tokenizer=tokenizer, )
13
  from pprint import pprint pprint(fill_mask(f"तिमीलाई कस्तो {tokenizer.mask_token}."))
14
  ```
15
+
16
+ ## Data Description
17
+ Trained on about 4.6 GB of Nepali text corpus collected from various sources
18
+ These data were collected from nepali news site, OSCAR nepali corpus