Mistral-Small-3.2-AntiRep-24B:

  • Exactly what it says on the tin, Orpo'd Mistral Small 3.2 to remove repetition.
  • Trained to reduce infinite repetition, repetition of structure and sentences in multi turn conversation, and repetition within responses.
  • Got really annoyed with all of my Mistral Small test models having repetition issues, so I decided to whip this up.
  • Produced by doing orpo with Qwen 3 8B at 0 temp + .7 rep pen (<1 increases repetition) as rejected vs V3 03/24 as chosen.
  • The LoRA is also available too, if you want to use it to reduce repetition on other MS3.2 tunes.

Enjoy!

Downloads last month
2
Safetensors
Model size
24B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ConicCat/Mistral-Small-3.2-AntiRep-24B

Finetuned
(61)
this model
Finetunes
2 models
Merges
1 model
Quantizations
5 models

Dataset used to train ConicCat/Mistral-Small-3.2-AntiRep-24B

Collection including ConicCat/Mistral-Small-3.2-AntiRep-24B