1 103 36

js

rldy

AI & ML interests

None yet

Recent Activity

liked a Space 2 days ago

smolagents/smolagents-leaderboard

liked a dataset 10 days ago

GeneralReasoning/GeneralThought-195K

upvoted a paper 13 days ago

Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation

View all activity

Organizations

rldy's activity

liked a Space 2 days ago

smolagents LLM leaderboard

🏆

A leaderboard for LLMs powering smolagents

liked a dataset 10 days ago

GeneralReasoning/GeneralThought-195K

Viewer • Updated 3 days ago • 195k • 1.12k • 66

liked a dataset 17 days ago

SynthLabsAI/Big-Math-RL-Verified

Viewer • Updated 7 days ago • 251k • 5.36k • 149

liked a dataset 20 days ago

xlangai/AgentTrek

Viewer • Updated 22 days ago • 52.6k • 249 • 19

liked a dataset 21 days ago

bethgelab/CuratedThoughts

Viewer • Updated 15 days ago • 222k • 1.72k • 37

liked a model 21 days ago

microsoft/wham

Updated 21 days ago • 8.44k • 245

liked a dataset 21 days ago

SakanaAI/AI-CUDA-Engineer-Archive

Viewer • Updated 22 days ago • 30.6k • 14.2k • 138

liked a dataset 22 days ago

facebook/natural_reasoning

Viewer • Updated 21 days ago • 1.15M • 10.7k • 398

liked a Space 22 days ago

2.24k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a Space 26 days ago

237

Agent Leaderboard

💬

Ranking of LLMs for agentic tasks

liked a model 28 days ago

NousResearch/DeepHermes-3-Llama-3-8B-Preview

Text Generation • Updated about 5 hours ago • 17.4k • 294

liked a dataset 29 days ago

open-r1/OpenR1-Math-Raw

Viewer • Updated 17 days ago • 516k • 2.19k • 72

liked 2 datasets about 1 month ago

open-r1/OpenR1-Math-220k

Viewer • Updated 23 days ago • 450k • 53k • 491

nebius/SWE-agent-trajectories

Viewer • Updated Dec 23, 2024 • 80k • 463 • 52

liked a Space about 1 month ago

CoT-Lab: Human-AI Co-Thinking Laboratory

🤖

liked a model about 2 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated 18 days ago • 2.75M • • 11.3k

liked 2 models 2 months ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated Jan 13 • 4.42k • 540

katanemo/Arch-Function-3B

Text Generation • Updated Feb 5 • 1.31k • • 111

liked a Space 3 months ago

534

Scaling test-time compute

📈

Enhance math problem solving by scaling test-time compute

liked a dataset 3 months ago

HuggingFaceTB/finemath

Viewer • Updated Feb 6 • 48.3M • 10.6k • 292