End of training

Browse files

Files changed (4) hide show

README.md +71 -0
generation_config.json +13 -0
model.safetensors +1 -1
runs/Jan05_08-06-57_a15af13407bd/events.out.tfevents.1704442018.a15af13407bd.26.0 +2 -2

README.md ADDED Viewed

	@@ -0,0 +1,71 @@

+---
+base_model: Ransaka/sinhala-ocr-model
+tags:
+- generated_from_trainer
+model-index:
+- name: sinhala-ocr-model-v3
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# sinhala-ocr-model-v3
+This model is a fine-tuned version of [Ransaka/sinhala-ocr-model](https://huggingface.co/Ransaka/sinhala-ocr-model) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 4.7242
+- Cer: 0.2764
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 1e-05
+- train_batch_size: 8
+- eval_batch_size: 8
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 16
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- training_steps: 6000
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Cer    |
+|:-------------:|:-----:|:----:|:---------------:|:------:|
+| 3.6711        | 6.54  | 500  | 4.9311          | 0.4178 |
+| 2.3499        | 13.07 | 1000 | 4.5366          | 0.3482 |
+| 1.5601        | 19.61 | 1500 | 4.4634          | 0.3204 |
+| 0.987         | 26.14 | 2000 | 4.4804          | 0.3011 |
+| 0.6487        | 32.68 | 2500 | 4.6310          | 0.2863 |
+| 0.3816        | 39.22 | 3000 | 4.6093          | 0.2788 |
+| 0.3494        | 45.75 | 3500 | 4.6291          | 0.2827 |
+| 0.2357        | 52.29 | 4000 | 4.6399          | 0.2780 |
+| 0.2188        | 58.82 | 4500 | 4.6313          | 0.2798 |
+| 0.1413        | 65.36 | 5000 | 4.6828          | 0.2768 |
+| 0.0985        | 71.9  | 5500 | 4.7135          | 0.2772 |
+| 0.1086        | 78.43 | 6000 | 4.7242          | 0.2764 |
+### Framework versions
+- Transformers 4.35.2
+- Pytorch 2.0.0
+- Datasets 2.16.0
+- Tokenizers 0.15.0

generation_config.json ADDED Viewed

	@@ -0,0 +1,13 @@

+{
+  "bos_token_id": 0,
+  "decoder_start_token_id": 2,
+  "early_stopping": true,
+  "eos_token_id": 3,
+  "length_penalty": 2.0,
+  "max_length": 64,
+  "no_repeat_ngram_size": 3,
+  "num_beams": 4,
+  "pad_token_id": 0,
+  "transformers_version": "4.35.2",
+  "use_cache": false
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:af9780b8655121ba34549bbace2d073b0494eaa9f1831a8cab9ac562276a45a5
 size 1260933520

 version https://git-lfs.github.com/spec/v1
+oid sha256:fa9f7341a4eb1f781de092c1537a2ca8c2fdff5462a91f9288cf8c6029b48bb4
 size 1260933520

runs/Jan05_08-06-57_a15af13407bd/events.out.tfevents.1704442018.a15af13407bd.26.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5d0adc504bf3ed1f94f52915c40de0f1591845127821846892210b63ee0639fe
-size 49969

 version https://git-lfs.github.com/spec/v1
+oid sha256:c1fcf4344d1c4876861427e970fee1d165e31c99bc85dc86d99c2e832e1ec442
+size 50323