ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 2 days ago • 72
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations Paper • 2412.07626 • Published 11 days ago • 20
LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding Paper • 2410.17434 • Published Oct 22 • 25
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper • 2412.03555 • Published 17 days ago • 118
Multimodal Autoregressive Pre-training of Large Vision Encoders Paper • 2411.14402 • Published 30 days ago • 41
Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation Paper • 2409.12941 • Published Sep 19 • 23
view article Article ColPali: Efficient Document Retrieval with Vision Language Models 👀 By manu • Jul 5 • 180
view article Article Image Similarity with Hugging Face Datasets and Transformers Jan 16, 2023 • 21
Searching for Best Practices in Retrieval-Augmented Generation Paper • 2407.01219 • Published Jul 1 • 11
Many-Shot In-Context Learning in Multimodal Foundation Models Paper • 2405.09798 • Published May 16 • 26
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper • 2405.01535 • Published May 2 • 119
Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models Paper • 2404.18796 • Published Apr 29 • 68
Revisiting Text-to-Image Evaluation with Gecko: On Metrics, Prompts, and Human Ratings Paper • 2404.16820 • Published Apr 25 • 15
Advancing LLM Reasoning Generalists with Preference Trees Paper • 2404.02078 • Published Apr 2 • 44
TnT-LLM: Text Mining at Scale with Large Language Models Paper • 2403.12173 • Published Mar 18 • 19