roberta-base-korean-morph-upos
Model Description
This is a RoBERTa model pre-trained on Korean texts for POS-tagging and dependency-parsing, derived from roberta-base-korean-hanja and morphUD-korean. Every morpheme (형태소) is tagged by UPOS(Universal Part-Of-Speech).
How to Use
from transformers import AutoTokenizer,AutoModelForTokenClassification,TokenClassificationPipeline
tokenizer=AutoTokenizer.from_pretrained("KoichiYasuoka/roberta-base-korean-morph-upos")
model=AutoModelForTokenClassification.from_pretrained("KoichiYasuoka/roberta-base-korean-morph-upos")
pipeline=TokenClassificationPipeline(tokenizer=tokenizer,model=model,aggregation_strategy="simple")
nlp=lambda x:[(x[t["start"]:t["end"]],t["entity_group"]) for t in pipeline(x)]
print(nlp("홍시 맛이 나서 홍시라 생각한다."))
or
import esupar
nlp=esupar.load("KoichiYasuoka/roberta-base-korean-morph-upos")
print(nlp("홍시 맛이 나서 홍시라 생각한다."))
See Also
esupar: Tokenizer POS-tagger and Dependency-parser with BERT/RoBERTa/DeBERTa models
- Downloads last month
- 25
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.
Model tree for KoichiYasuoka/roberta-base-korean-morph-upos
Base model
klue/roberta-base
Finetuned
KoichiYasuoka/roberta-base-korean-hanja