Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning Paper • 2502.06060 • Published 10 days ago • 31
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents Paper • 2306.16527 • Published Jun 21, 2023 • 47
theojolliffe/bart-large-cnn-pubmed1o3-pubmed2o3 Text2Text Generation • Updated May 27, 2022 • 121 • 1