Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published Dec 20, 2024 • 38
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published Dec 20, 2024 • 38
Offline Reinforcement Learning for LLM Multi-Step Reasoning Paper • 2412.16145 • Published Dec 20, 2024 • 38 • 6
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published Dec 9, 2024 • 75 • 7
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published Dec 9, 2024 • 75
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published Dec 9, 2024 • 75
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published Dec 9, 2024 • 75 • 7
Reasoning with Language Model is Planning with World Model Paper • 2305.14992 • Published May 24, 2023 • 3
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings Paper • 2305.11554 • Published May 19, 2023 • 2
Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking Paper • 2406.05673 • Published Jun 9, 2024 • 3
Pandora: Towards General World Model with Natural Language Actions and Video States Paper • 2406.09455 • Published Jun 12, 2024 • 15
BertNet: Harvesting Knowledge Graphs with Arbitrary Relations from Pretrained Language Models Paper • 2206.14268 • Published Jun 28, 2022 • 1
Pandora: Towards General World Model with Natural Language Actions and Video States Paper • 2406.09455 • Published Jun 12, 2024 • 15
Flow of Reasoning: Efficient Training of LLM Policy with Divergent Thinking Paper • 2406.05673 • Published Jun 9, 2024 • 3
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings Paper • 2305.11554 • Published May 19, 2023 • 2
BertNet: Harvesting Knowledge Graphs with Arbitrary Relations from Pretrained Language Models Paper • 2206.14268 • Published Jun 28, 2022 • 1
Reasoning with Language Model is Planning with World Model Paper • 2305.14992 • Published May 24, 2023 • 3