Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models Paper • 2411.06272 • Published 13 days ago • 3