kogpt2-wellness / README.md
a2ran's picture
Update README.md
b16c3b1
---
license: cc-by-nc-sa-4.0
tags:
- generated_from_trainer
model-index:
- name: kogpt2-finetuned-chatbot
results: []
---
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->
# kogpt2-finetuned-chatbot
This model is a fine-tuned version of [skt/kogpt2-base-v2](https://huggingface.co/skt/kogpt2-base-v2) on an unknown dataset.
It achieves the following results on the evaluation set:
- Loss: 0.6311
- '<\unused1>' ํ† ํฐ์„ ๊ธฐ์ค€์œผ๋กœ ์งˆ๋ฌธ, ๋ฐœํ™” ๋‹ต๋ณ€์„ ๋‚˜๋ˆˆ ์‘๋‹ตํ˜• text generation pretrained ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.
### Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 3e-05
- train_batch_size: 16
- eval_batch_size: 16
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 3.0
### Training results
| Training Loss | Epoch | Step | Validation Loss |
|:-------------:|:-----:|:-----:|:---------------:|
| 0.9794 | 1.0 | 4436 | 0.8402 |
| 0.7568 | 2.0 | 8872 | 0.6767 |
| 0.6748 | 3.0 | 13308 | 0.6311 |
### Framework versions
- Transformers 4.26.1
- Pytorch 1.13.1+cu116
- Tokenizers 0.13.2