Safetensors
llama
Edit model card

EVA Yi 1.5 9B v1

A RP/storywriting focused model, full-parameter finetune of Yi-1.5-9B-32K on mixture of synthetic and natural data.
A continuation of nothingiisreal's Celeste 1.x series, made to improve stability and versatility, without losing unique, diverse writing style of Celeste.

Quants: (GGUF is not recommended, lcpp breaks tokenizer fix)

We recommend using original BFloat16 weights, quantization seems to affect Yi significantly more than other model architectures.

Prompt format is ChatML.

Recommended sampler values:

  • Temperature: 1
  • Min-P: 0.05

Recommended SillyTavern presets (via CalamitousFelicitousness):


Training data:

  • Celeste 70B 0.1 data mixture minus Opus Instruct subset. See that model's card for details.
  • Kalomaze's Opus_Instruct_25k dataset, filtered for refusals.

Hardware used:

  • 4x3090Ti for 5 days.

Model was trained by Kearm and Auri.

Special thanks:

  • to Lemmy, Gryphe, Kalomaze and Nopm for the data
  • to ALK, Fizz and CalamitousFelicitousness for Yi tokenizer fix
  • and to InfermaticAI's community for their continued support for our endeavors
Downloads last month
11
Safetensors
Model size
8.83B params
Tensor type
BF16
·
Inference API
Unable to determine this model's library. Check the docs .

Model tree for EVA-UNIT-01/EVA-Yi-1.5-9B-32K-V1

Finetuned
(1)
this model
Quantizations
11 models

Datasets used to train EVA-UNIT-01/EVA-Yi-1.5-9B-32K-V1