Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • about 1 hour ago
✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use By Ziyang • about 2 hours ago • 4
🐺🐦⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram • about 21 hours ago • 19
Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • 4 days ago • 17
🦸🏻#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows By Kseniase • 6 days ago • 8
Unlocking the Power of Reasoning: Introducing CriticalThinker-LLaMA-3.1-8B-GGUF and Its Groundbreaking Dataset By theeseus-ai • 7 days ago
🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI? By Kseniase • 9 days ago • 5
**Intelligence Potentiation: An Evolutionary Perspective on AI Agent Designs** By KnutJaegersberg • 15 days ago • 3
SILMA RAGQA V1.0: A Comprehensive Benchmark for Evaluating LLMs on RAG QA Use-Cases By karimouda • 16 days ago • 1
Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • about 1 hour ago
✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use By Ziyang • about 2 hours ago • 4
🐺🐦⬛ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram • about 21 hours ago • 19
Fine-tune ModernBERT for text classification using synthetic data By davidberenstein1957 • 4 days ago • 17
🦸🏻#2: Your Go-To Vocabulary to Navigate the World of AI Agents and Agentic Workflows By Kseniase • 6 days ago • 8
Unlocking the Power of Reasoning: Introducing CriticalThinker-LLaMA-3.1-8B-GGUF and Its Groundbreaking Dataset By theeseus-ai • 7 days ago
🦸🏻#1: Open-endedness and AI Agents – A Path from Generative to Creative AI? By Kseniase • 9 days ago • 5
**Intelligence Potentiation: An Evolutionary Perspective on AI Agent Designs** By KnutJaegersberg • 15 days ago • 3
SILMA RAGQA V1.0: A Comprehensive Benchmark for Evaluating LLMs on RAG QA Use-Cases By karimouda • 16 days ago • 1