Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2501.04227

about 2 hours ago

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

Paper • 2501.10799 • Published 8 days ago • 11
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 18 days ago • 248
Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 19 days ago • 81

Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback

Paper • 2501.03916 • Published 19 days ago • 14
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 18 days ago • 89
Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 19 days ago • 81
Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 17 days ago • 80

Structured Agent Operations

Agents for self-driving laboratories applied to quantum computing

Paper • 2412.07978 • Published Dec 10, 2024 • 1
Towards Scientific Discovery with Generative AI: Progress, Opportunities, and Challenges

Paper • 2412.11427 • Published Dec 16, 2024 • 1
AEGIS: An Agent-based Framework for General Bug Reproduction from Issue Descriptions

Paper • 2411.18015 • Published Nov 27, 2024 • 1
LLM4SR: A Survey on Large Language Models for Scientific Research

Paper • 2501.04306 • Published 18 days ago • 33

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 19 days ago • 81
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and Reflection

Paper • 2501.04575 • Published 18 days ago • 23

Papers exploring autonomous AI systems and frameworks for building intelligent agents that can perceive environment, plan actions and use tools.

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 19 days ago • 81
Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains

Paper • 2501.05707 • Published 16 days ago • 19
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 6 days ago • 74

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 19 days ago • 81

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 19 days ago • 81
Search-o1: Agentic Search-Enhanced Large Reasoning Models

Paper • 2501.05366 • Published 17 days ago • 80
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published 6 days ago • 74
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments

Paper • 2501.10893 • Published 8 days ago • 22

Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 19 days ago • 81

Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 18 days ago • 89
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 18 days ago • 248
Agent Laboratory: Using LLM Agents as Research Assistants

Paper • 2501.04227 • Published 19 days ago • 81

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published 18 days ago • 248
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though

Paper • 2501.04682 • Published 18 days ago • 89
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3, 2024 • 54
Think Before You Speak: Cultivating Communication Skills of Large Language Models via Inner Monologue

Paper • 2311.07445 • Published Nov 13, 2023

Previous
1
2
3
4
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs