Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 30 days ago • 45
Large Language Model-Brained GUI Agents: A Survey Paper • 2411.18279 • Published Nov 27, 2024 • 27
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published Oct 17, 2024 • 41
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published Oct 17, 2024 • 41
Web Agents with World Models: Learning and Leveraging Environment Dynamics in Web Navigation Paper • 2410.13232 • Published Oct 17, 2024 • 41 • 2
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code Paper • 2409.19715 • Published Sep 29, 2024 • 9
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code Paper • 2409.19715 • Published Sep 29, 2024 • 9
Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code Paper • 2409.19715 • Published Sep 29, 2024 • 9 • 3
DLI-Lab/step-wise-eval-description-with-refined-tao-raw-neg_actions Viewer • Updated Sep 21, 2024 • 102 • 30
DLI-Lab/step-wise-eval-additional-description-with-refined-tao Viewer • Updated Sep 19, 2024 • 33 • 30
DLI-Lab/step-wise-eval-small-description-with-refined-tao Viewer • Updated Sep 18, 2024 • 69 • 30