How FaR Are Large Language Models From Agents with Theory-of-Mind? Paper • 2310.03051 • Published Oct 4, 2023 • 34
RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs Paper • 2305.08844 • Published May 15, 2023 • 1