QuantFactory
/

Mistral-7B-Instruct-RDPO-GGUF

Text Generation

Model card Files Files and versions

QuantFactory/Mistral-7B-Instruct-RDPO-GGUF

This is quantized version of princeton-nlp/Mistral-7B-Instruct-RDPO created using llama.cpp

Model Description

This is a model released from the preprint: SimPO: Simple Preference Optimization with a Reference-Free Reward Please refer to our repository for more details.

Downloads last month: 172

GGUF

Model size

7B params

Architecture

llama

Hardware compatibility

Log In to add your hardware

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Model tree for QuantFactory/Mistral-7B-Instruct-RDPO-GGUF

Base model

princeton-nlp/Mistral-7B-Instruct-RDPO

Quantized

(2)

this model

Collection including QuantFactory/Mistral-7B-Instruct-RDPO-GGUF

Mistral-AI

Quantized versions of models by mistralai • 19 items • Updated Oct 3, 2024 • 6

Paper for QuantFactory/Mistral-7B-Instruct-RDPO-GGUF

SimPO: Simple Preference Optimization with a Reference-Free Reward

Paper • 2405.14734 • Published May 23, 2024 • 12