I am a Strange Dataset: Metalinguistic Tests for Language Models Paper • 2401.05300 • Published Jan 10 • 4
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality Paper • 2204.03162 • Published Apr 7, 2022 • 1
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language Paper • 2306.16410 • Published Jun 28, 2023 • 28
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants Paper • 2112.09062 • Published Dec 16, 2021
Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks Paper • 2204.01906 • Published Apr 5, 2022
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset Paper • 2303.03915 • Published Mar 7, 2023 • 6
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 27
olm/olm-CC-MAIN-2022-49-sampling-ratio-olm-0.15114822547 Viewer • Updated Feb 5, 2023 • 17.1M • 328 • 3