Wei Fu's picture

Wei Fu

garrett4wade

·

garrett4wade

AI & ML interests

RL

Recent Activity

updated a dataset 15 days ago

inclusionAI/AReaL-RL-Data

authored a paper 10 months ago

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

authored a paper 10 months ago

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

View all activity

Organizations

garrett4wade's activity

updated a dataset 15 days ago

inclusionAI/AReaL-RL-Data

Updated 15 days ago • 141

authored 2 papers 10 months ago

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores

Paper • 2306.16688 • Published Jun 29, 2023

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Paper • 2404.10719 • Published Apr 16, 2024 • 5