ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 15 days ago • 111
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 8 items • Updated 16 days ago • 17
Smaller Language Models Are Better Instruction Evolvers Paper • 2412.11231 • Published 19 days ago • 26
Synthetic Data Generator Collection A collection of tools and datasets related to no-code the Synthetic Data Generation. • 16 items • Updated 7 minutes ago • 5
Evaluating Language Models as Synthetic Data Generators Paper • 2412.03679 • Published 30 days ago • 45
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 21 days ago • 121
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb • Nov 28, 2024 • 127
LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation Paper • 2411.04997 • Published Nov 7, 2024 • 37
view article Article How to build a custom text classifier without days of human labeling By sdiazlor • Oct 17, 2024 • 55
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled • Oct 14, 2024 • 61
Enriching Music Descriptions Collection Dataset for Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval: https://ieeexplore.ieee.org/document/10446380 • 6 items • Updated Apr 22, 2024 • 2
view article Article Synthetic dataset generation techniques: Self-Instruct By davanstrien • May 15, 2024 • 14
Synchronize Dual Hands for Physics-Based Dexterous Guitar Playing Paper • 2409.16629 • Published Sep 25, 2024 • 10
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation Paper • 2407.11798 • Published Jul 16, 2024 • 1
Enhance Your Images Collection Some trending Gradio apps on Spaces that you can use to enhance/upscale your images for free. This collection will be kept uptodate with new releases. • 7 items • Updated Aug 22, 2024 • 17