license: gemma library_name: peft tags: - trl - reward-trainer - generated_from_trainer base_model: google/gemma-2b metrics: - accuracy model-index: - name: RM-HH-AllMix_harmless_gpt3_20000_gemma2b_shuffleTrue_extractchosenTrue results: []
This model is a fine-tuned version of google/gemma-2b on the None dataset. It achieves the following results on the evaluation set:
More information needed
The following hyperparameters were used during training: