view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 101
Llama 2: Open Foundation and Fine-Tuned Chat Models Paper • 2307.09288 • Published Jul 18, 2023 • 242