Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 30 days ago • 45
Large Language Model-Brained GUI Agents: A Survey Paper • 2411.18279 • Published Nov 27, 2024 • 27
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published Oct 17, 2024 • 41
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code Paper • 2409.19715 • Published Sep 29, 2024 • 9
VerifiNER: Verification-augmented NER via Knowledge-grounded Reasoning with Large Language Models Paper • 2402.18374 • Published Feb 28, 2024 • 2
Coffee: Boost Your Code LLMs by Fixing Bugs with Feedback Paper • 2311.07215 • Published Nov 13, 2023 • 3
Dialogue Chain-of-Thought Distillation for Commonsense-aware Conversational Agents Paper • 2310.09343 • Published Oct 13, 2023 • 2