SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated about 1 hour ago • 29
JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation Paper • 2411.07975 • Published Nov 12 • 27
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 9 days ago • 130
Embedding Model Datasets Collection A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 67 items • Updated Jul 3 • 87
Molmo Collection Artifacts for open multimodal language models. • 5 items • Updated 25 days ago • 289
GaussianAnything: Interactive Point Cloud Latent Diffusion for 3D Generation Paper • 2411.08033 • Published Nov 12 • 22
VoxPopuli v2 Collection A collection of checkpoints from the second VoxPopuli release. • 35 items • Updated Jan 16 • 5
VoxPopuli Collection A collection of open-source artefacts (datasets + checkpoints) from the first VoxPopuli release. • 32 items • Updated Jan 16 • 4
Robust Wav2Vec 2.0 Collection A collection of "robust" Wav2Vec 2.0 checkpoints pre-trained on datasets from multiple domains. • 4 items • Updated Jan 16 • 3
XLSR Collection A collection of multilingual Wav2Vec 2.0 checkpoints pre-trained on 53 languages and fine-tuned for CTC speech recognition. • 12 items • Updated Jan 16 • 6
Wav2Vec 2.0 Collection A collection for the first release of Wav2Vec 2.0, a speech encoder that learns powerful representations from unlabelled audio data. • 8 items • Updated Jan 16 • 17
SeamlessM4T Collection SeamlessM4T is designed to provide high quality translation, allowing people from different linguistic communities to communicate effortlessly. • 9 items • Updated Jan 16 • 14
StyleStudio: Text-Driven Style Transfer with Selective Control of Style Elements Paper • 2412.08503 • Published 11 days ago • 8
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 17 items • Updated 2 days ago • 56