arxiv:2409.17066
YangWang92
yangwang92
AI & ML interests
None yet
Recent Activity
liked
a model
about 16 hours ago
ezelikman/quietstar-8-ahead
upvoted
a
paper
about 19 hours ago
Kimi k1.5: Scaling Reinforcement Learning with LLMs
upvoted
a
paper
about 19 hours ago
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via
Reinforcement Learning
Organizations
Papers
1
models
None public yet
datasets
None public yet