Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model Paper • 2408.10764 • Published Aug 20, 2024 • 9
DataMan: Data Manager for Pre-training Large Language Models Paper • 2502.19363 • Published Feb 26 • 1
DataMan: Data Manager for Pre-training Large Language Models Paper • 2502.19363 • Published Feb 26 • 1