Papers
arxiv:2411.14405

Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions

Published on Nov 21
· Submitted by akhaliq on Nov 22
#2 Paper of the day
Authors:
,
,
,
,
,
,
,
,

Abstract

Currently OpenAI o1 has sparked a surge of interest in the study of large reasoning models (LRM). Building on this momentum, Marco-o1 not only focuses on disciplines with standard answers, such as mathematics, physics, and coding -- which are well-suited for reinforcement learning (RL) -- but also places greater emphasis on open-ended resolutions. We aim to address the question: "Can the o1 model effectively generalize to broader domains where clear standards are absent and rewards are challenging to quantify?" Marco-o1 is powered by Chain-of-Thought (CoT) fine-tuning, Monte Carlo Tree Search (MCTS), reflection mechanisms, and innovative reasoning strategies -- optimized for complex real-world problem-solving tasks.

Community

Paper submitter

Sign up or log in to comment

Models citing this paper 2

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2411.14405 in a dataset README.md to link it from this page.

Spaces citing this paper 2

Collections including this paper 1