OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations Paper β’ 2412.07626 β’ Published 24 days ago β’ 21
Cosmos Tokenizer Collection A suite of image and video tokenizers β’ 12 items β’ Updated 17 days ago β’ 28
view article Article BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*β‘ By xhluca β’ Jul 9, 2024 β’ 41
AMD-OLMo Collection AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinctβ’ MI250 GPUs based on OLMo. β’ 4 items β’ Updated Oct 31, 2024 β’ 17
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M β’ 15 items β’ Updated 12 days ago β’ 196
C4AI Aya Expanse Collection Aya Expanse is an open-weight research release of a model with highly advanced multilingual capabilities. β’ 3 items β’ Updated 18 days ago β’ 30
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 β’ 167
Awesome Document AI Collection A collection of open-source document AI π π π β’ 27 items β’ Updated Mar 11, 2024 β’ 76
VisFocus: Prompt-Guided Vision Encoders for OCR-Free Dense Document Understanding Paper β’ 2407.12594 β’ Published Jul 17, 2024 β’ 19
view article Article Llama can now see and run on your device - welcome Llama 3.2 Sep 25, 2024 β’ 180
π» Local SmolLMs Collection SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos β’ 14 items β’ Updated 12 days ago β’ 47
MoMa: Efficient Early-Fusion Pre-training with Mixture of Modality-Aware Experts Paper β’ 2407.21770 β’ Published Jul 31, 2024 β’ 22
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper β’ 2408.11039 β’ Published Aug 20, 2024 β’ 58
Llama-3.1 Quantization Collection Neural Magic quantized Llama-3.1 models β’ 22 items β’ Updated Nov 22, 2024 β’ 42
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 183
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 β’ 66
view article Article From cloud to developers: Hugging Face and Microsoft Deepen Collaboration May 21, 2024 β’ 8