ORPO: Monolithic Preference Optimization without Reference Model Paper • 2403.07691 • Published Mar 12 • 62
ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models Paper • 2404.07738 • Published Apr 11 • 2
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published May 2 • 118