The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published 1 day ago • 43
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models 3 days ago • 88
view article Article Enhancing Image Model Dreambooth Training Through Effective Captioning: Key Observations By alvdansen • 7 days ago • 11
How Do Large Language Models Acquire Factual Knowledge During Pretraining? Paper • 2406.11813 • Published 9 days ago • 27
MobileCLIP Models + DataCompDR Data Collection MobileCLIP: Mobile-friendly image-text models with SOTA zero-shot capabilities. DataCompDR: Improved datasets for training image-text SOTA models. • 22 items • Updated 7 days ago • 17
Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Paper • 2403.03206 • Published Mar 5 • 47
Quantum Embedding with Transformer for High-dimensional Data Paper • 2402.12704 • Published Feb 20 • 2
INDUS: Effective and Efficient Language Models for Scientific Applications Paper • 2405.10725 • Published May 17 • 23
view article Article Multimodal Augmentation for Documents: Recovering “Comprehension” in “Reading and Comprehension” task By danaaubakirova • May 16 • 15
view article Article A Guide to Designing New Functional Proteins and Improving Protein Function, Stability, and Diversity with Generative AI By AmelieSchreiber • May 14 • 22
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Apr 22 • 75
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x • 3 days ago • 48
view article Article Leveraging Transformers and PyTorch for Multiple Choice Question Tasks By Andyrasika • Dec 25, 2023 • 1
view article Article Robust image watermarking with Stable Signature + IMATAG's BZH By imatag-vch • Jan 22 • 1
view article Article Serverless Image Similarity with Upstash Vector and Huggingface Models, Datasets and Spaces By omerXfaruq • Jan 31 • 2
view article Article Streamline Computer Vision Workflows with Hugging Face Transformers and FiftyOne By jamarks • Feb 27 • 7
view article Article Orchestration of Experts: The First-Principle Multi-Model System By alirezamsh • 27 days ago • 14
view article Article RAG Empowerment: Cohere C4AI Command-R and Transformers Unveiled By Andyrasika • Apr 7 • 10
view article Article DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive By bpan • Apr 9 • 28
view article Article From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease Oct 21, 2022 • 6
view article Article Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B Apr 4 • 22
view article Article Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval Mar 22 • 43