Tianhao Wu's picture

6 4 6

Tianhao Wu

ThWu

·

AI & ML interests

None yet

Recent Activity

updated a dataset about 2 months ago

ThWu/gsm8k_formatted_small

View all activity

Organizations

ThWu's activity

upvoted a paper 5 months ago

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Paper • 2407.19594 • Published Jul 28 • 20

upvoted a paper 6 months ago

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

Paper • 2406.11939 • Published Jun 17 • 6

upvoted 2 papers about 1 year ago

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 75

Pairwise Proximal Policy Optimization: Harnessing Relative Feedback for LLM Alignment

Paper • 2310.00212 • Published Sep 30, 2023 • 2