Training completed

Browse files

Files changed (5) hide show

README.md +68 -0
generation_config.json +8 -0
model.safetensors +1 -1
runs/Jun10_17-37-01_f81d5a87c7e5/events.out.tfevents.1718041569.f81d5a87c7e5.34.0 +2 -2
runs/Jun10_17-37-01_f81d5a87c7e5/events.out.tfevents.1718067195.f81d5a87c7e5.34.1 +3 -0

README.md ADDED Viewed

	@@ -0,0 +1,68 @@

+---
+base_model: vinai/bartpho-syllable
+tags:
+- text2text-generation
+- generated_from_trainer
+metrics:
+- sacrebleu
+model-index:
+- name: vietnamese-correction-203
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# vietnamese-correction-203
+This model is a fine-tuned version of [vinai/bartpho-syllable](https://huggingface.co/vinai/bartpho-syllable) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.1838
+- Sacrebleu: 27.6927
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 4
+- eval_batch_size: 4
+- seed: 42
+- gradient_accumulation_steps: 8
+- total_train_batch_size: 32
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 5
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch  | Step  | Validation Loss | Sacrebleu |
+|:-------------:|:------:|:-----:|:---------------:|:---------:|
+| 0.3565        | 0.7529 | 2000  | 0.1943          | 26.5749   |
+| 0.1717        | 1.5059 | 4000  | 0.1713          | 27.1811   |
+| 0.116         | 2.2588 | 6000  | 0.1705          | 27.3551   |
+| 0.076         | 3.0118 | 8000  | 0.1582          | 27.5668   |
+| 0.0418        | 3.7647 | 10000 | 0.1674          | 27.5994   |
+| 0.0278        | 4.5176 | 12000 | 0.1839          | 27.6218   |
+### Framework versions
+- Transformers 4.41.2
+- Pytorch 2.1.2
+- Datasets 2.19.2
+- Tokenizers 0.19.1

generation_config.json ADDED Viewed

	@@ -0,0 +1,8 @@

+{
+  "bos_token_id": 0,
+  "decoder_start_token_id": 2,
+  "eos_token_id": 2,
+  "forced_eos_token_id": 2,
+  "pad_token_id": 1,
+  "transformers_version": "4.41.2"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b026f2a73b3bc56e622b6e017539985032cfc09a6ff5d0099ba66855542e49db
 size 1583480280

 version https://git-lfs.github.com/spec/v1
+oid sha256:30a9d2832e6f68496928d8b64eb40c448ed0e9a7377fb7c7ddd1ec02a3119a1e
 size 1583480280

runs/Jun10_17-37-01_f81d5a87c7e5/events.out.tfevents.1718041569.f81d5a87c7e5.34.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:40bb3d6fd3b5fa1326088154ec34b042fba213a081adbc508c23d5342742741a
-size 8798

 version https://git-lfs.github.com/spec/v1
+oid sha256:d4bac22baa4213efa279bde150f4cb709fb3b12d5be8823e2b8f5b4dbd319d5a
+size 9152

runs/Jun10_17-37-01_f81d5a87c7e5/events.out.tfevents.1718067195.f81d5a87c7e5.34.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:84c8c6184192e5ca74a53c090e719b4204addec605e9a7481e58f698c7cf743e
+size 412