Rizkinoor16's picture
Create README.md
a3824ba
|
raw
history blame
1.59 kB
metadata
language:
  - id
tags:
  - punctuation prediction
  - punctuation
widget:
  - text: halo bagaimana kabarmu
    example_title: indonesian

This model predicts the punctuation of Indonesian languange. It has been created to restore punctuation of transcribed from speech recognition models. This model Based on the work https://github.com/oliverguhr/fullstop-deep-punctuation-prediction

The model restores the following punctuation markers: "." "," "?" "-" ":"

Install

To get started install the package from pypi:

pip install deepmultilingualpunctuation

Restore Punctuation

from deepmultilingualpunctuation import PunctuationModel

model = PunctuationModel("Rizkinoor16/fullstop-indonesian-punctuation-prediction")
text = "halo bagaimana kabarmu"
result = model.restore_punctuation(text)
print(result)

Results

  precision    recall  f1-score   support 

       0       0.98      0.99      0.98  38057720
       .       0.89      0.91      0.90   2234980
       ,       0.84      0.79      0.81   3037655
       ?       0.84      0.79      0.82     72969
       -       0.96      0.90      0.93    162085
       :       0.91      0.89      0.90    191937

accuracy                           0.97  43757346

macro avg 0.90 0.88 0.89 43757346 weighted avg 0.97 0.97 0.97 43757346

Contact

Rizki Noor rizki@cakra.ai Linkedin : Noor Muhamad Rizki