GritLM is a generative representational instruction tuned language model. It unifies text representation (embedding) and text generation into a single model achieving state-of-the-art performance on both types of tasks.

Layers	Context	Template (Text Representation)	Template (Text Generation)
32	32768	<s><\|user\|> {instruction} <\|embed\|> {sample}	<s><\|user\|> {prompt} <\|assistant\|> {response}

Downloads last month: 19

GGUF

Model size

7.24B params

Architecture

llama

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the HF Inference API does not support gguf models with pipeline type text-generation