MARS: Unleashing the Power of Variance Reduction for Training Large Models Paper • 2411.10438 • Published Nov 15 • 13
Accelerated Preference Optimization for Large Language Model Alignment Paper • 2410.06293 • Published Oct 8 • 4
LLaVA-Critic Collection as a general evaluator for assessing model performance • 6 items • Updated Oct 6 • 8
General Preference Modeling with Preference Representations for Aligning Language Models Paper • 2410.02197 • Published Oct 3 • 8
ProteinBench: A Holistic Evaluation of Protein Foundation Models Paper • 2409.06744 • Published Sep 10 • 7
Self-Play Preference Optimization for Language Model Alignment Paper • 2405.00675 • Published May 1 • 24
Understanding Transferable Representation Learning and Zero-shot Transfer in CLIP Paper • 2310.00927 • Published Oct 2, 2023 • 1
Rephrase and Respond: Let Large Language Models Ask Better Questions for Themselves Paper • 2311.04205 • Published Nov 7, 2023 • 5
Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment Paper • 2308.05374 • Published Aug 10, 2023 • 27
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Paper • 2401.01335 • Published Jan 2 • 64