hyunwoongko
commited on
Commit
•
0203ac2
1
Parent(s):
3794f05
Update README.md
Browse files
README.md
CHANGED
@@ -37,8 +37,8 @@ Polyglot-Ko-3.8B was trained on 863 GB of Korean language data (1.2TB before pro
|
|
37 |
|-------------------------------------|---------|------------------------------------------|
|
38 |
| Korean blog posts | 682.3 | - |
|
39 |
| Korean news dataset | 87.0 | - |
|
40 |
-
| Modu corpus |
|
41 |
-
| Korean patent dataset |
|
42 |
| Korean Q & A dataset | 18.1 | - |
|
43 |
| KcBert dataset | 12.7 | github.com/Beomi/KcBERT |
|
44 |
| Korean fiction dataset | 6.1 | - |
|
|
|
37 |
|-------------------------------------|---------|------------------------------------------|
|
38 |
| Korean blog posts | 682.3 | - |
|
39 |
| Korean news dataset | 87.0 | - |
|
40 |
+
| Modu corpus | 26.4 |corpus.korean.go.kr |
|
41 |
+
| Korean patent dataset | 19.0 | - |
|
42 |
| Korean Q & A dataset | 18.1 | - |
|
43 |
| KcBert dataset | 12.7 | github.com/Beomi/KcBERT |
|
44 |
| Korean fiction dataset | 6.1 | - |
|