macadeliccc
/

SOLAR-10.7b-Instruct-truthy-dpo

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

SOLAR-10.7b-Instruct-truthy-dpo / README.md

macadeliccc's picture

Update README.md

b03d05e verified 10 months ago

|

546 Bytes

	---
	library_name: transformers
	tags: []
	---
	# SOLAR-10.7b-Instruct-truthy-dpo

	![orca-bagel](orca-bagel.png)

	This model is a finetune of [macadeliccc/SOLAR-10.7b-Instruct-truthy-dpo](https://huggingface.co/macadeliccc/SOLAR-10.7b-Instruct-dpo)

	## Process

	1. I finetuned upstageai/Solar-10.7b-Instruct-v0.1 with 1 epoch of Intel/orca_dpo_pairs (12.4k samples)
	2. I futher finetuned that model with 3 epochs of jondurbin/truthy-dpo-v0.1 (1.04k samples)
	3. This process is experimental and the base model linked above is more tested at this time.