LayoutLM - a microsoft Collection

microsoft 's Collections

Phi-4

Phi-3

Phi-1

TAPEX

Table Transformer

Orca

UDOP

GIT

IFMs

LayoutLM

updated 19 days ago

The LayoutLM series are Transformer encoders useful for document AI tasks such as invoice parsing, document image classification and DocVQA.

microsoft/layoutlmv3-base

Updated Apr 10, 2024 • 2.48M • 367

Note Currently the best LayoutLM model.
microsoft/layoutlmv2-base-uncased

Updated Sep 16, 2022 • 916k • 63
microsoft/layoutlm-base-uncased

Updated Apr 16, 2024 • 1.63M • 50
microsoft/layoutxlm-base

Updated Sep 16, 2022 • 37.1k • 70

Note A multilingual variant trained on 100 languages.
impira/layoutlm-document-qa

Document Question Answering • Updated Mar 18, 2023 • 53.1k • • 1.07k

Note A LayoutLM (v1) model fine-tuned to perform question answering over documents (DocVQA).
nielsr/layoutlmv3-finetuned-funsd

Token Classification • Updated Sep 16, 2023 • 1.95k • 24

Note A LayoutLMv3 model fine-tuned on the FUNSD dataset, a benchmark for document parsing.