maxkretchmer
/

output

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

maxkretchmer commited on Dec 17, 2023

Commit

8d19794

·

1 Parent(s): b57eca8

maxkretchmer/gc-mixtral

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -3,7 +3,7 @@ license: apache-2.0
 library_name: peft
 tags:
 - generated_from_trainer
-base_model: upstage/SOLAR-10.7B-v1.0
 model-index:
 - name: output
   results: []
@@ -14,7 +14,7 @@ should probably proofread and complete it, then remove this comment. -->
 # output
-This model is a fine-tuned version of [upstage/SOLAR-10.7B-v1.0](https://huggingface.co/upstage/SOLAR-10.7B-v1.0) on the None dataset.
 ## Model description
@@ -34,7 +34,7 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2.5e-05
-- train_batch_size: 4
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08

 library_name: peft
 tags:
 - generated_from_trainer
+base_model: mistralai/Mixtral-8x7B-v0.1
 model-index:
 - name: output
   results: []
 # output
+This model is a fine-tuned version of [mistralai/Mixtral-8x7B-v0.1](https://huggingface.co/mistralai/Mixtral-8x7B-v0.1) on the None dataset.
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2.5e-05
+- train_batch_size: 1
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08