DFN Models + Data Collection CLIP Models trained using DFN-2B/DFN-5B datasets • 5 items • Updated 10 days ago • 10
TiC-CLIP Collection Benchmark for the design of efficient continual learning of image-text models over years. • 18 items • Updated 10 days ago • 4
MobileCLIP Models + DataCompDR Data Collection MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities. DataCompDR: Improved datasets for training image-text SOTA models. • 22 items • Updated 9 days ago • 17
MS MARCO Mined Triplets Collection These datasets contain MS MARCO Triplets gathered by mining hard negatives using various models. Each dataset has various subsets. • 14 items • Updated May 21 • 6
Parallel Sentences Datasets Collection These datasets all have "english" and "non_english" columns for numerous datasets. They can be used to make embedding models multilingual. • 13 items • Updated 11 days ago • 4
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 66 items • Updated 8 days ago • 42
Qwen1.5 Collection Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 23 days ago • 198
Qwen2 Collection Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 29 items • Updated 23 days ago • 217
ImageInWords Release Collection arXiv: https://arxiv.org/abs/2405.02793 • 3 items • Updated 2 days ago • 1
IndicGenBench Collection Datasets released in "IndicGenBench: A Multilingual Benchmark to Evaluate Generation Capabilities of LLMs" (https://arxiv.org/abs/2404.16816) • 4 items • Updated 2 days ago • 3
PaliGemma Release Collection Pretrained and mix checkpoints for PaliGemma • 16 items • Updated 2 days ago • 118
Granite Speculators Collection A collection of accelerators for the Granite LLM/Code family of models • 4 items • Updated about 9 hours ago • 5
Granite Time Series Models Collection A collection of time series models trained by IBM licensed under Apache 2.0 license. • 4 items • Updated about 9 hours ago • 7
Granite Code Models Collection A series of code models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 20 items • Updated about 9 hours ago • 145
Aya Datasets Collection The Aya Collection is a massive multilingual collection for over 100 languages consisting of 513 million instances of prompts and completions. • 5 items • Updated 1 day ago • 9
C4AI Command R Collection C4AI Command-R is a research release of a 35 billion parameter highly performant generative model. Command-R is a large language model with open weigh • 3 items • Updated May 23 • 12
C4AI Command R Plus Collection C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities. • 3 items • Updated May 23 • 20
C4AI Aya 23 Collection Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 3 items • Updated May 23 • 38
Embedding Models (English) Collection Various English language embedding models in GGUF format • 3 items • Updated Feb 20 • 2
ShareGPT Datasets Collection Datasets (not by me) that I converted to the ShareGPT format • 5 items • Updated Apr 29 • 1
Spaces of the Week Collection My spaces or spaces I worked featured on Spaces of the Week! Ones at the top are the oldest, newest at the bottom 🤗 • 6 items • Updated Apr 29 • 2
PhotoMaker Collection Let us create photos/paintings/avatars for anyone in any style within seconds. • 3 items • Updated about 20 hours ago • 19
T2I-Adapter-SDXL Collection The smallest and most efficient control models for SDXL! • 8 items • Updated Sep 8, 2023 • 24
Llama3-ChatQA-1.5 Collection Llama3-ChatQA-1.5 models excel at conversational question answering (QA) and retrieval-augmented generation (RAG). • 6 items • Updated 15 days ago • 37
RLHF Collection A collection of models trained with Reinforcement Learning from Human Feedback (RLHF). • 4 items • Updated 11 days ago • 3
OpenMath Collection A collection of models and datasets introduced in "OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset" • 15 items • Updated 15 days ago • 32
Canary Collection A collection of multilingual and multitask speech to text models from NVIDIA NeMo 🐤 • 1 item • Updated 15 days ago • 16
Parakeet Collection NeMo Parakeet ASR Models attain strong speech recognition accuracy while being efficient for inference. Available in CTC and RNN-Transducer variants. • 6 items • Updated 15 days ago • 15
SteerLM Collection A collection of models and datasets relating to SteerLM and HelpSteer. • 7 items • Updated 11 days ago • 11
Arctic-embed Collection A collection of text embedding models optimized for retrieval accuracy and efficiency • 5 items • Updated Apr 17 • 11
Arctic Collection A collection of pre-trained dense-MoE Hybrid transformer models • 2 items • Updated Apr 24 • 20
SpeechT5 Collection The SpeechT5 framework consists of a shared seq2seq and six modal-specific (speech/text) pre/post-nets that can address a few audio-related tasks. • 8 items • Updated May 22 • 14
TAPEX Collection TAPEX is the state-of-the-art table pre-training models which can be used for table-based question answering and table-based fact verification. • 10 items • Updated May 22 • 4
Table Transformer Collection The Table Transformer (TATR) is a series of object detection models useful for table extraction from PDF images. • 5 items • Updated May 22 • 15
LayoutLM Collection The LayoutLM series are Transformer encoders useful for document AI tasks such as invoice parsing, document image classification and DocVQA. • 5 items • Updated May 22 • 9
GIT Collection GIT (Generative Image-to-text Transformer) is a model useful for vision-language tasks such as image/video captioning and question answering. • 18 items • Updated May 22 • 5
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 22 items • Updated 29 days ago • 346