view article Article Crowd-sourced Open Preference Dataset for Text-to-Image Generation By RapidataAI β’ 11 days ago β’ 17
view article Article Synthetic Data Generation with FastData and Hugging Face By asoria β’ 10 days ago β’ 13
view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 15 days ago β’ 37
view article Article β΄οΈ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use By Ziyang β’ 15 days ago β’ 12
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 β’ 15 days ago β’ 30
view article Article Superposition in Transformers: A Novel Way of Building Mixture of Experts By BenChaliah β’ 14 days ago β’ 14
view article Article AI in 2025: A Combinatorial Explosion of Possibilities, but NOT AGI By Kseniase β’ 14 days ago β’ 3
view article Article Building Effective Agents with Anthropicβs Best Practices and smolagents β€οΈ By Sri-Vigneshwar-DJ β’ 13 days ago β’ 4
view article Article **Fine-tune SmolLM's on custom synthetic data** By prithivMLmods β’ 12 days ago β’ 16
view article Article How to Automate Reddit Comment Generation with AI Agents in KaibanJS By darielnoel β’ 11 days ago β’ 4
view article Article Announcing NVIDIA Cosmos World Foundation Models By mingyuliutw β’ 11 days ago β’ 22
view article Article Accelerating Language Model Inference with Mixture of Attentions By hba123 β’ 11 days ago β’ 24
Datasets: A Community Library for Natural Language Processing Paper β’ 2109.02846 β’ Published Sep 7, 2021 β’ 11
view article Article π¦Έπ»#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows By Kseniase β’ 21 days ago β’ 9
view article Article Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 β’ 19 days ago β’ 23
view article Article Finetuning Falcon 7b in a hybrid distributed fashion By Neo111x β’ 18 days ago β’ 4
DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought Paper β’ 2412.17498 β’ Published 26 days ago β’ 21