-
Evaluating and Aligning CodeLLMs on Human Preference
Paper • 2412.05210 • Published • 47 -
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners
Paper • 2412.17256 • Published • 46 -
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
Paper • 2412.21187 • Published • 40 -
HUNYUANPROVER: A Scalable Data Synthesis Framework and Guided Tree Search for Automated Theorem Proving
Paper • 2412.20735 • Published • 11
Yijie Chen
pppa
·
AI & ML interests
None yet
Recent Activity
liked
a dataset
2 days ago
HuggingFaceFW/fineweb-2
upvoted
a
paper
15 days ago
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse
Attention
upvoted
a
paper
about 1 month ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Organizations
Collections
4
models
None public yet
datasets
None public yet