arxiv:2412.14161
Graham Neubig
gneubig
AI & ML interests
NLP
Recent Activity
updated
a dataset
1 day ago
gneubig/aime-1983-2024
authored
a paper
3 days ago
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World
Tasks
upvoted
a
paper
4 days ago
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World
Tasks
Organizations
Papers
20
models
None public yet