pandyamarut
/

llama-fr-lora

Generated from Trainer

Model card Files Files and versions Community

pandyamarut commited on 22 days ago

Commit

c2d1a62

·

verified ·

1 Parent(s): a657a13

End of training

Files changed (2) hide show

README.md +9 -7
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -3,11 +3,12 @@ library_name: peft
 license: llama3.2
 base_model: NousResearch/Llama-3.2-1B
 tags:
 - generated_from_trainer
 datasets:
 - teknium/GPT4-LLM-Cleaned
 model-index:
-- name: runpod-volume/fine-tuning/test-run
   results: []
 ---
@@ -32,6 +33,7 @@ flash_attention: true
 gradient_accumulation_steps: 2
 gradient_checkpointing: true
 group_by_length: false
 learning_rate: 0.0002
 load_in_4bit: false
 load_in_8bit: false
@@ -56,7 +58,7 @@ optimizer: adamw_8bit
 output_dir: /runpod-volume/fine-tuning/test-run
 pad_to_sequence_len: true
 run_name: test-run
-runpod_job_id: 429dafc1-0a1d-4786-b89e-56ad95660dcd-u1
 sample_packing: true
 saves_per_epoch: 1
 sequence_len: 2048
@@ -76,11 +78,11 @@ weight_decay: 0
 </details><br>
-# runpod-volume/fine-tuning/test-run
 This model is a fine-tuned version of [NousResearch/Llama-3.2-1B](https://huggingface.co/NousResearch/Llama-3.2-1B) on the teknium/GPT4-LLM-Cleaned dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.1008
 ## Model description
@@ -115,9 +117,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.4537        | 0.0009 | 1    | 1.3971          |
-| 1.1962        | 0.2503 | 271  | 1.1543          |
-| 1.1661        | 0.5007 | 542  | 1.1133          |
-| 1.1916        | 0.7510 | 813  | 1.1008          |
 ### Framework versions

 license: llama3.2
 base_model: NousResearch/Llama-3.2-1B
 tags:
+- axolotl
 - generated_from_trainer
 datasets:
 - teknium/GPT4-LLM-Cleaned
 model-index:
+- name: llama-fr-lora
   results: []
 ---
 gradient_accumulation_steps: 2
 gradient_checkpointing: true
 group_by_length: false
+hub_model_id: pandyamarut/llama-fr-lora
 learning_rate: 0.0002
 load_in_4bit: false
 load_in_8bit: false
 output_dir: /runpod-volume/fine-tuning/test-run
 pad_to_sequence_len: true
 run_name: test-run
+runpod_job_id: b7693c20-f1ab-4572-ad4f-bf19fa790d82-u1
 sample_packing: true
 saves_per_epoch: 1
 sequence_len: 2048
 </details><br>
+# llama-fr-lora
 This model is a fine-tuned version of [NousResearch/Llama-3.2-1B](https://huggingface.co/NousResearch/Llama-3.2-1B) on the teknium/GPT4-LLM-Cleaned dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.1014
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.4537        | 0.0009 | 1    | 1.3971          |
+| 1.1953        | 0.2503 | 271  | 1.1562          |
+| 1.1678        | 0.5007 | 542  | 1.1135          |
+| 1.1912        | 0.7510 | 813  | 1.1014          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0ba6bfa74a362141391ed07bcba88a39d6f6c6f9e6f9998e85eb2624ef8156ca
 size 45169354

 version https://git-lfs.github.com/spec/v1
+oid sha256:79b8de8c3045ce62cac19cefbf984c4d2db1a67dd6e142abdce6e355a504e44b
 size 45169354