Improving Visual Commonsense in Language Models via Multiple Image Generation Paper • 2406.13621 • Published 14 days ago • 13
Make It Count: Text-to-Image Generation with an Accurate Number of Objects Paper • 2406.10210 • Published 19 days ago • 74