cooleel
's Collections
DocAI
updated
Document Parsing Unveiled: Techniques, Challenges, and Prospects for
Structured Information Extraction
Paper
•
2410.21169
•
Published
•
30
LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via
Hybrid Architecture
Paper
•
2409.02889
•
Published
•
55
M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page
Multi-document Understanding
Paper
•
2411.04952
•
Published
•
28
Contextual Document Embeddings
Paper
•
2410.02525
•
Published
•
18
PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with
End-to-End Sparse Sampling
Paper
•
2410.05970
•
Published
READoc: A Unified Benchmark for Realistic Document Structured Extraction
Paper
•
2409.05137
•
Published
Xmodel-1.5: An 1B-scale Multilingual LLM
Paper
•
2411.10083
•
Published
•
14
M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding
And A Retrieval-Aware Tuning Framework
Paper
•
2411.06176
•
Published
•
45
CC1984/mall_receipt_extraction_dataset
Viewer
•
Updated
•
1.8k
•
35
•
1
VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal
Retrieval-Augmented Generation
Paper
•
2412.10704
•
Published
•
15