Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress Paper • 2408.14960 • Published Aug 27
RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs Paper • 2407.02552 • Published Jul 2 • 4
The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm Paper • 2406.18682 • Published Jun 26
Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning Paper • 2410.10801 • Published Oct 14