7 171 9

Robin Williams PRO

bfuzzy1

AI & ML interests

None yet

Recent Activity

updated a collection 2 days ago

Nifty

upvoted a paper 2 days ago

SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models

updated a collection 2 days ago

Nifty

View all activity

Organizations

None yet

bfuzzy1's activity

upvoted a paper 2 days ago

SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models

Paper • 2502.09390 • Published 5 days ago • 14

upvoted a collection 4 days ago

Tools for learning AI

Collection

This is a collection of tools on the hub that teachers and students can use to learn AI! • 9 items • Updated about 19 hours ago • 55

upvoted a paper 4 days ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published 14 days ago • 57

upvoted an article 5 days ago

Article

1 Billion Classifications

5 days ago

• 37

upvoted a paper 7 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published 10 days ago • 106

upvoted 3 papers 9 days ago

upvoted 4 papers 10 days ago

Large Language Model Guided Self-Debugging Code Generation

Paper • 2502.02928 • Published 13 days ago • 10

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published 13 days ago • 51

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published 13 days ago • 53

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 13 days ago • 179

upvoted a paper 18 days ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published 20 days ago • 54

upvoted a paper 20 days ago

Towards General-Purpose Model-Free Reinforcement Learning

Paper • 2501.16142 • Published 22 days ago • 26

upvoted 2 papers 21 days ago

RL + Transformer = A General-Purpose Problem Solver

Paper • 2501.14176 • Published 25 days ago • 24

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published 26 days ago • 63

upvoted a paper 23 days ago

Control LLM: Controlled Evolution for Intelligence Retention in LLM

Paper • 2501.10979 • Published 30 days ago • 6

upvoted 3 papers 25 days ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 27 days ago • 24

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published 27 days ago • 93

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 27 days ago • 321