PIQA: Reasoning about Physical Commonsense in Natural Language Paper • 1911.11641 • Published Nov 26, 2019 • 2
SocialIQA: Commonsense Reasoning about Social Interactions Paper • 1904.09728 • Published Apr 22, 2019 • 2
HellaSwag: Can a Machine Really Finish Your Sentence? Paper • 1905.07830 • Published May 19, 2019 • 4
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms Paper • 1905.13319 • Published May 30, 2019 • 2
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published May 2 • 118