ldahee's picture
upload model14
b1644c8
metadata
license: apache-2.0
language:
  - en

4season/model_eval_test14

Introduction

This model is test version, alignment-tuned model.

We utilize state-of-the-art instruction fine-tuning methods including direct preference optimization (DPO).