SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation Paper • 2410.13293 • Published Oct 17, 2024 • 2
SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation Paper • 2410.13293 • Published Oct 17, 2024 • 2 • 2
OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework Paper • 2405.11143 • Published May 20, 2024 • 34
ReProHRL: Towards Multi-Goal Navigation in the Real World using Hierarchical Agents Paper • 2308.08737 • Published Aug 17, 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward Model Paper • 2305.18290 • Published May 29, 2023 • 50
view article Article Preference Tuning LLMs with Direct Preference Optimization Methods Jan 18, 2024 • 40