Yu Yang's picture

3 7 2

Yu Yang

yuyangy

·

https://sites.google.com/g.ucla.edu/yuyang/home

AI & ML interests

None yet

Organizations

yuyangy's activity

upvoted a paper 2 months ago

Flow-DPO: Improving LLM Mathematical Reasoning through Online Multi-Agent Learning

Paper • 2410.22304 • Published Oct 29, 2024 • 17

upvoted 2 papers 3 months ago

SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models

Paper • 2403.07384 • Published Mar 12, 2024 • 1

SecCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

Paper • 2410.11096 • Published Oct 14, 2024 • 12

upvoted a paper 6 months ago

MIRAI: Evaluating LLM Agents for Event Forecasting

Paper • 2407.01231 • Published Jul 1, 2024 • 16

upvoted a paper 7 months ago

Weak-to-Strong Extrapolation Expedites Alignment

Paper • 2404.16792 • Published Apr 25, 2024 • 11

upvoted a collection 7 months ago

Weak-to-Strong Extrapolation Expedites Alignment

Better aligned models obtained by weak-to-strong model extrapolation (ExPO) • 25 items • Updated 26 days ago • 17

upvoted a paper about 1 year ago

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2, 2024 • 64