Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval Mar 22, 2024 β’ 69
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 β’ 4 days ago β’ 16
Granite 3.1 Language Models Collection A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. β’ 8 items β’ Updated 16 days ago β’ 43
Spectrum: Targeted Training on Signal to Noise Ratio Paper β’ 2406.06623 β’ Published Jun 7, 2024 β’ 12
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb β’ Nov 28, 2024 β’ 127
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published 16 days ago β’ 116
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated 14 days ago β’ 111
view article Article Building a Local Vector Database Index with Annoy and Sentence Transformers By theeseus-ai β’ 28 days ago β’ 3
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ 29 days ago β’ 74
view article Article Accelerating Embedding & Reranking Models on AMD Using Infinity By michaelfeil β’ about 1 month ago β’ 4
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper β’ 2411.12946 β’ Published Nov 20, 2024 β’ 20
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 β’ Nov 21, 2024 β’ 35
Drowning in Documents: Consequences of Scaling Reranker Inference Paper β’ 2411.11767 β’ Published Nov 18, 2024 β’ 17
view article Article Halo: Open Source Health Tracking with Wearables By cyrilzakka β’ Nov 19, 2024 β’ 99
view article Article Releasing the largest multilingual open pretraining dataset By Pclanglais β’ Nov 13, 2024 β’ 98
Training with Prompts Collection See the Training with Prompts documentation for more details: https://sbert.net/examples/training/prompts/README.html β’ 5 items β’ Updated Nov 7, 2024 β’ 3
view article Article Releasing Common Corpus: the largest public domain dataset for training LLMs By Pclanglais β’ Mar 20, 2024 β’ 18
Model2Vec base models Collection These are the Minishlab Model2Vec base models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers β’ 7 items β’ Updated 19 days ago β’ 8