LMDX: Language Model-based Document Information Extraction and Localization Paper • 2309.10952 • Published Sep 19, 2023 • 65
DocLLM: A layout-aware generative language model for multimodal document understanding Paper • 2401.00908 • Published Dec 31, 2023 • 181
google-bert/bert-large-uncased-whole-word-masking-finetuned-squad Question Answering • Updated Feb 19, 2024 • 186k • 173
openai/clip-vit-large-patch14 Zero-Shot Image Classification • Updated Sep 15, 2023 • 42.5M • 1.59k
google-research-datasets/conceptual_captions Viewer • Updated Jun 17, 2024 • 5.34M • 13k • 90