GLiNER Collection Knowledgator GLiNER models for information extraction • 8 items • Updated 27 days ago • 9
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 23 days ago • 136
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais • Nov 13, 2024 • 98
M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding Paper • 2411.04952 • Published Nov 7, 2024 • 28
view article Article ColFlor: Towards BERT-Size Vision-Language Document Retrieval Models By ahmed-masry • Oct 18, 2024 • 16