-
-
-
-
-
-
Inference Providers
Active filters:
orpo
shenzhi-wang/Llama3-8B-Chinese-Chat
Text Generation
•
8B
•
Updated
•
9.68k
•
•
683
shenzhi-wang/Llama3.1-8B-Chinese-Chat
Text Generation
•
8B
•
Updated
•
24.5k
•
•
263
v000000/Qwen2.5-Lumen-14B
Text Generation
•
15B
•
Updated
•
1.13k
•
•
21
OpenBabylon/MamayLM-ORPO-align-lora256
nbeerbower/Denker-mistral-nemo-12B
Text Generation
•
12B
•
Updated
•
9
•
4
AmberYifan/Llama-3.1-8B-sft-SPIN-Llama-3.1-70B-Instruct-ORPO
Text Generation
•
8B
•
Updated
•
16
•
1
Rustamshry/Psychology-RLHF
Text Generation
•
Updated
•
28
•
1
Rustamshry/Social-RLHF
Text Generation
•
Updated
•
20
•
1
mradermacher/Denker-mistral-nemo-12B-GGUF
12B
•
Updated
•
257
•
1
mradermacher/Denker-mistral-nemo-12B-i1-GGUF
12B
•
Updated
•
541
•
1
mradermacher/Llama-3.1-8B-sft-SPIN-Llama-3.1-70B-Instruct-ORPO-GGUF
8B
•
Updated
•
2.31k
•
1
alvarobartt/Mistral-7B-v0.1-ORPO
Text Generation
•
7B
•
Updated
•
5
•
14
alvarobartt/Mistral-7B-v0.1-ORPO-PEFT
Text Generation
•
Updated
•
5
•
1
anakin87/gemma-2b-orpo
Text Generation
•
3B
•
Updated
•
452
•
28
alvarobartt/mistral-orpo-mix
Text Generation
•
7B
•
Updated
•
4
•
1
bartowski/Mistral-7B-v0.1-ORPO-exl2
Text Generation
•
Updated
bartowski/Mistral-7B-v0.1-ORPO-GGUF
Text Generation
•
7B
•
Updated
•
85
•
1
alvarobartt/mistral-orpo-mix-b0.1-l2048-pl1792-lr5e-6-inverse-sqrt
Text Generation
•
7B
•
Updated
•
5
alvarobartt/mistral-orpo-mix-b0.05-l1024-pl512-lr5e-7-cosine
Text Generation
•
7B
•
Updated
•
4
alvarobartt/mistral-7b-orpo-alignment-handbook
Text Generation
•
7B
•
Updated
•
6
alvarobartt/mistral-7b-orpo-airoboros-pref-10k
Text Generation
•
7B
•
Updated
•
5
mradermacher/mistral-7b-orpo-capybara-reproduction-GGUF
7B
•
Updated
•
478
vain05/stablelm-2-1_6b-orpo-full-v1
Text Generation
•
2B
•
Updated
•
4
anakin87/gemma-2b-orpo-GGUF
3B
•
Updated
•
2
•
7
vain05/stablelm-2-1_6b-orpo-full-v2
Text Generation
•
2B
•
Updated
•
4
vain05/stablelm-2-1_6b-orpo-full-v3
Text Generation
•
2B
•
Updated
•
5
HuggingFaceH4/zephyr-orpo-141b-A35b-v0.1
Text Generation
•
141B
•
Updated
•
37
•
268
MaziyarPanahi/zephyr-orpo-141b-A35b-v0.1-GGUF
Text Generation
•
141B
•
Updated
•
126
•
29
mlx-community/zephyr-orpo-141b-A35b-v0.1-4bit
22B
•
Updated
•
7
•
2
wandb/zephyr-orpo-7b-v0.2
Text Generation
•
7B
•
Updated
•
6
•
4