Shushant Pudasaini
commited on
Commit
•
7756f55
1
Parent(s):
8d3e10b
data description
Browse filesdata description added
README.md
CHANGED
@@ -12,3 +12,7 @@ from transformers import pipeline
|
|
12 |
fill_mask = pipeline( "fill-mask", model=model, tokenizer=tokenizer, )
|
13 |
from pprint import pprint pprint(fill_mask(f"तिमीलाई कस्तो {tokenizer.mask_token}."))
|
14 |
```
|
|
|
|
|
|
|
|
|
|
12 |
fill_mask = pipeline( "fill-mask", model=model, tokenizer=tokenizer, )
|
13 |
from pprint import pprint pprint(fill_mask(f"तिमीलाई कस्तो {tokenizer.mask_token}."))
|
14 |
```
|
15 |
+
|
16 |
+
## Data Description
|
17 |
+
Trained on about 4.6 GB of Nepali text corpus collected from various sources
|
18 |
+
These data were collected from nepali news site, OSCAR nepali corpus
|