view article Article Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO) By ariG23498 • 11 days ago • 13
view article Article Hugging Face and FriendliAI partner to supercharge model deployment on the Hub 8 days ago • 29
Kendamarron/roleplay-multiturn-calm3-chat-format Viewer • Updated Sep 10, 2024 • 3.31k • 39 • 2