Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
Libraries
Languages
Licenses
Other
1
Apps
llama.cpp
LM Studio
Jan
Draw Things
DiffusionBee
Jellybox
JoyFusion
LocalAI
vLLM
Ollama
MLX LM
Docker Model Runner
Lemonade
SGLang
Inference Providers
Select all
Groq
Novita
Cerebras
SambaNova
Nscale
fal
Hyperbolic
Together AI
Fireworks
Featherless AI
Zai
Replicate
Cohere
Scaleway
Public AI
OVHcloud AI Endpoints
HF Inference API
WaveSpeed
Misc
Reset Misc
alignment
Inference Endpoints
text-generation-inference
Eval Results (legacy)
text-embeddings-inference
4-bit precision
Merge
custom_code
8-bit precision
Mixture of Experts
Carbon Emissions
Eval Results
Apply filters
Models
1,031
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
alignment
Clear all
q-hisa/dpo-qwen-cot-lora-v5
Text Generation
•
Updated
1 day ago
•
7
takami2022/dpo-qwen3-4b-structured-v2_SFT_SystemPrompt
Text Generation
•
4B
•
Updated
1 day ago
•
27
moushi21/dpo-qwen-cot-merged18
Text Generation
•
4B
•
Updated
1 day ago
•
27
takami2022/dpo-qwen3-4b-structured-v3_SFT_SystemPrompt
Text Generation
•
4B
•
Updated
1 day ago
•
23
sfutenma/dpo-qwen3_4b-cot-merged_v260221-210728
Text Generation
•
4B
•
Updated
1 day ago
•
30
nyannto/dpo-qwen-cot-merged15
Text Generation
•
4B
•
Updated
1 day ago
•
23
sfutenma/dpo-qwen3_4b-cot-merged_v260221-223020
Text Generation
•
4B
•
Updated
1 day ago
•
27
moushi21/dpo-qwen-cot-merged19
Text Generation
•
4B
•
Updated
1 day ago
•
29
arata1/dpo-qwen-cot-e2-b05-1024
Text Generation
•
4B
•
Updated
1 day ago
•
28
kedumerikugame/dpo-qwen-cot-merged
Text Generation
•
4B
•
Updated
about 21 hours ago
•
22
shotalab/Qwen3-4B-Instruct-SFT-03-Merged-DPO-02
Text Generation
•
4B
•
Updated
1 day ago
•
30
biokrhr/dpo-qwen-cot-merged
Text Generation
•
4B
•
Updated
1 day ago
•
23
gimlet09/dpo-qwen-cot-merged-v5
Text Generation
•
Updated
1 day ago
Taichi11/sft_v7_dpo_v1_merged
Text Generation
•
4B
•
Updated
1 day ago
•
26
ryoto0175/dpo-qwen-cot-merged-v24
Text Generation
•
4B
•
Updated
about 23 hours ago
•
31
ryoto0175/dpo-qwen-cot-merged-v25
Text Generation
•
4B
•
Updated
about 22 hours ago
•
32
Orifusa/dpo-qwen-cot-merged_study11.5.1ya-cot
Text Generation
•
4B
•
Updated
about 20 hours ago
moushi21/dpo-qwen-cot-merged20
Text Generation
•
4B
•
Updated
about 19 hours ago
makotonlo/LLM2026_DPO_SFT19_v8
Text Generation
•
Updated
about 19 hours ago
ryoto0175/dpo-qwen-cot-merged-v26
Text Generation
•
4B
•
Updated
about 19 hours ago
Orifusa/dpo-qwen-cot-merged_study11.5.3ya
Text Generation
•
4B
•
Updated
about 17 hours ago
ryoto0175/dpo-qwen-cot-merged-v27
Text Generation
•
4B
•
Updated
about 17 hours ago
Hi-Satoh/adv_sft3J_dpo_merged
Text Generation
•
4B
•
Updated
about 11 hours ago
Taichi11/sft_v7_dpo_v2_merged
Text Generation
•
4B
•
Updated
about 15 hours ago
WassyO/qwen3-4b-instruct-dpo-from-sft-4096_u-10bei_v5
Text Generation
•
Updated
about 15 hours ago
Orifusa/dpo-qwen-cot-merged_study11.5.4ya
Text Generation
•
4B
•
Updated
about 15 hours ago
moushi21/dpo-qwen-cot-merged21
Text Generation
•
4B
•
Updated
about 14 hours ago
shinich001/dpo-qwen-cot-merged
Text Generation
•
4B
•
Updated
about 12 hours ago
ogwata/exp19-enhanced-dpo
Text Generation
•
4B
•
Updated
about 12 hours ago
q-hisa/dpo-qwen-cot-merged-v6
Text Generation
•
4B
•
Updated
about 11 hours ago
Previous
1
...
32
33
34
35
Next