Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models Paper • 2411.06272 • Published 15 days ago • 3