pandyamarut
/

llama-fr-lora

Generated from Trainer

Model card Files Files and versions Community

pandyamarut commited on 21 days ago

Commit

0fe279d

·

verified ·

1 Parent(s): 07360ae

End of training

Files changed (2) hide show

README.md +7 -7
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -58,7 +58,7 @@ optimizer: adamw_8bit
 output_dir: /runpod-volume/fine-tuning/test-run
 pad_to_sequence_len: true
 run_name: test-run
-runpod_job_id: b7693c20-f1ab-4572-ad4f-bf19fa790d82-u1
 sample_packing: true
 saves_per_epoch: 1
 sequence_len: 2048
@@ -82,7 +82,7 @@ weight_decay: 0
 This model is a fine-tuned version of [NousResearch/Llama-3.2-1B](https://huggingface.co/NousResearch/Llama-3.2-1B) on the teknium/GPT4-LLM-Cleaned dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.1014
 ## Model description
@@ -117,15 +117,15 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.4537        | 0.0009 | 1    | 1.3971          |
-| 1.1953        | 0.2503 | 271  | 1.1562          |
-| 1.1678        | 0.5007 | 542  | 1.1135          |
-| 1.1912        | 0.7510 | 813  | 1.1014          |
 ### Framework versions
 - PEFT 0.14.0
 - Transformers 4.47.1
-- Pytorch 2.3.1+cu121
-- Datasets 3.1.0
 - Tokenizers 0.21.0

 output_dir: /runpod-volume/fine-tuning/test-run
 pad_to_sequence_len: true
 run_name: test-run
+runpod_job_id: dd327f42-5f67-4830-b512-4561fa9a3d45-u1
 sample_packing: true
 saves_per_epoch: 1
 sequence_len: 2048
 This model is a fine-tuned version of [NousResearch/Llama-3.2-1B](https://huggingface.co/NousResearch/Llama-3.2-1B) on the teknium/GPT4-LLM-Cleaned dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1018
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.4537        | 0.0009 | 1    | 1.3971          |
+| 1.1978        | 0.2503 | 271  | 1.1561          |
+| 1.1637        | 0.5007 | 542  | 1.1131          |
+| 1.1894        | 0.7510 | 813  | 1.1018          |
 ### Framework versions
 - PEFT 0.14.0
 - Transformers 4.47.1
+- Pytorch 2.5.1+cu124
+- Datasets 3.2.0
 - Tokenizers 0.21.0

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:79b8de8c3045ce62cac19cefbf984c4d2db1a67dd6e142abdce6e355a504e44b
 size 45169354

 version https://git-lfs.github.com/spec/v1
+oid sha256:8e60fa14c8637e48775bc40d495a321bde53978310acf0b5847bb380ec614a2c
 size 45169354