1 75 21

js

rldy

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

upvoted a paper 3 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

upvoted a paper 4 days ago

Reasoning Language Models: A Blueprint

View all activity

Organizations

rldy's activity

upvoted 2 papers 3 days ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 4 days ago • 56

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 4 days ago • 190

upvoted a paper 4 days ago

Reasoning Language Models: A Blueprint

Paper • 2501.11223 • Published 7 days ago • 23

liked a model 6 days ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated about 5 hours ago • 131k • 2.75k

upvoted a paper 6 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 10 days ago • 97

upvoted 2 papers 9 days ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 10 days ago • 65

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 12 days ago • 47

upvoted a paper 10 days ago

MangaNinja: Line Art Colorization with Precise Reference Following

Paper • 2501.08332 • Published 12 days ago • 55

upvoted a paper 11 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published 12 days ago • 268

upvoted 2 papers 12 days ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 15 days ago • 74

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published 15 days ago • 29

upvoted a collection 13 days ago

Reasoning Datasets

Collection

Reasoning datasets that are trending 🔥 • 10 items • Updated 23 days ago • 24

liked a model 15 days ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated 13 days ago • 11.9k • 512

upvoted 2 papers 17 days ago

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 19 days ago • 81

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 18 days ago • 248

liked a model 24 days ago

katanemo/Arch-Function-3B

Text Generation • Updated Dec 2, 2024 • 1.32k • 94

upvoted 3 papers about 1 month ago

Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree Search

Paper • 2412.18319 • Published Dec 24, 2024 • 37

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Paper • 2407.21787 • Published Jul 31, 2024 • 12

Solving math word problems with process- and outcome-based feedback

Paper • 2211.14275 • Published Nov 25, 2022 • 8

liked a Space about 1 month ago

Running

485

📈

js

AI & ML interests

Recent Activity

Organizations

rldy's activity

Scaling test-time compute