DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging Paper โข 2407.01470 โข Published Jul 1 โข 5