patrickxchong's picture
Add multilingual to the language tag (#1)
90b29a5
|
raw
history blame contribute delete
No virus
626 Bytes
metadata
language:
  - ms
  - en
  - multilingual
license: apache-2.0
tags:
  - text-classification
  - sentiment-analysis
widget:
  - text: Saya sangat gembira hari ini!

bert-tiny-bahasa-cased-sentiment

Proof of concept of creating a sentiment analysis model with using https://huggingface.co/malay-huggingface/bert-base-bahasa-cased as the base model.

Tokenizer is copied directly from https://huggingface.co/malay-huggingface/bert-base-bahasa-cased.

Sentiment analysis fine tuning was done with data compiled by huseinzol05 at https://github.com/huseinzol05/malay-dataset/tree/master/sentiment.