RL Fine-tuning Reasoning Collection A Collection of Papers on Using Reinforcement Learning to Enhance Reasoning • 2 items • Updated 2 days ago
RL Fine-tuning Tool Usage Collection Collection of papers that utilize reinforcement learning to enhance tool usage and function calling. • 2 items • Updated 2 days ago
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published 14 days ago • 57