EDIT: Works pretty well for a model with no finetuning, has promise. Better and lighter than the 14b.

A 13b Mistral base model, based on the NeverSleep recipe. We've had second Mistral, why not third Mistral?

slices

merge_method: passthrough

dtype: bfloat16

Safetensors

Model size

13B params

Tensor type

BF16

Model tree for athirdpath/BigMistral-13b

Adapters