2 4 2

Sijun Tan

sijuntan

jeffreysijuntan

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

upvoted a paper about 1 month ago

Training Software Engineering Agents and Verifiers with SWE-Gym

liked a dataset 3 months ago

ScalerLab/JudgeBench

View all activity

Organizations

sijuntan's activity

upvoted 2 papers about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 254

Training Software Engineering Agents and Verifiers with SWE-Gym

Paper • 2412.21139 • Published Dec 30, 2024 • 21

liked a dataset 3 months ago

ScalerLab/JudgeBench

Viewer • Updated Oct 9, 2024 • 620 • 261 • 4

updated a dataset 3 months ago

IAMJB/paper-central-pr

Viewer • Updated Oct 29, 2024 • 15 • 279

New activity in IAMJB/paper-central-pr 3 months ago

Add new entry for arXiv ID 2410.12784

#8 opened 3 months ago by

sijuntan

authored a paper 4 months ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 44

liked a Space 4 months ago

JudgeBench Leaderboard

🏆

Display and filter leaderboard results for LLM judges

upvoted a paper 4 months ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 44

commented a paper 4 months ago

JudgeBench: A Benchmark for Evaluating LLM-based Judges

Paper • 2410.12784 • Published Oct 16, 2024 • 44 •

upvoted a paper 4 months ago

LLoCO: Learning Long Contexts Offline

Paper • 2404.07979 • Published Apr 11, 2024 • 21

authored a paper 10 months ago

LLoCO: Learning Long Contexts Offline

Paper • 2404.07979 • Published Apr 11, 2024 • 21