view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 9 days ago β’ 36
Running on CPU Upgrade 12.2k π Open LLM Leaderboard Track, rank and evaluate open LLMs and chatbots