Cosmos Tokenizer Collection A suite of image and video tokenizers • 12 items • Updated 1 day ago • 29
BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents Paper • 2108.04539 • Published Aug 10, 2021
DenseNets Reloaded: Paradigm Shift Beyond ResNets and ViTs Paper • 2403.19588 • Published Mar 28, 2024 • 2
On Web-based Visual Corpus Construction for Visual Document Understanding Paper • 2211.03256 • Published Nov 7, 2022 • 1