sedrickkeh commited on
Commit
6987970
1 Parent(s): 5cb5ba5

Model save

Browse files
Files changed (2) hide show
  1. README.md +3 -6
  2. trainer_log.jsonl +2 -2
README.md CHANGED
@@ -4,7 +4,6 @@ license: apache-2.0
4
  base_model: mistralai/Mistral-7B-v0.1
5
  tags:
6
  - llama-factory
7
- - full
8
  - generated_from_trainer
9
  model-index:
10
  - name: mistral_alpaca_sft_sample
@@ -16,9 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  # mistral_alpaca_sft_sample
18
 
19
- This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the llamafactory/alpaca_en dataset.
20
- It achieves the following results on the evaluation set:
21
- - Loss: 1.7878
22
 
23
  ## Model description
24
 
@@ -49,13 +46,13 @@ The following hyperparameters were used during training:
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: cosine
51
  - lr_scheduler_warmup_ratio: 0.1
52
- - training_steps: 2
53
 
54
  ### Training results
55
 
56
  | Training Loss | Epoch | Step | Validation Loss |
57
  |:-------------:|:------:|:----:|:---------------:|
58
- | No log | 0.0870 | 2 | 1.7878 |
59
 
60
 
61
  ### Framework versions
 
4
  base_model: mistralai/Mistral-7B-v0.1
5
  tags:
6
  - llama-factory
 
7
  - generated_from_trainer
8
  model-index:
9
  - name: mistral_alpaca_sft_sample
 
15
 
16
  # mistral_alpaca_sft_sample
17
 
18
+ This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on an unknown dataset.
 
 
19
 
20
  ## Model description
21
 
 
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: cosine
48
  - lr_scheduler_warmup_ratio: 0.1
49
+ - training_steps: 3
50
 
51
  ### Training results
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:------:|:----:|:---------------:|
55
+ | No log | 0.1304 | 3 | 1.6965 |
56
 
57
 
58
  ### Framework versions
trainer_log.jsonl CHANGED
@@ -1,2 +1,2 @@
1
- {"current_steps": 2, "total_steps": 2, "eval_loss": 1.7878271341323853, "epoch": 0.08695652173913043, "percentage": 100.0, "elapsed_time": "0:00:30", "remaining_time": "0:00:00"}
2
- {"current_steps": 2, "total_steps": 2, "epoch": 0.08695652173913043, "percentage": 100.0, "elapsed_time": "0:00:30", "remaining_time": "0:00:00"}
 
1
+ {"current_steps": 3, "total_steps": 3, "eval_loss": 1.696488618850708, "epoch": 0.13043478260869565, "percentage": 100.0, "elapsed_time": "0:02:09", "remaining_time": "0:00:00"}
2
+ {"current_steps": 3, "total_steps": 3, "epoch": 0.13043478260869565, "percentage": 100.0, "elapsed_time": "0:21:34", "remaining_time": "0:00:00"}