view article Article πΊπ¦ββ¬ LLM Comparison/Test: DeepSeek-V3, QVQ-72B-Preview, Falcon3 10B, Llama 3.3 70B, Nemotron 70B in my updated MMLU-Pro CS benchmark By wolfram β’ 7 days ago β’ 34
Adding Conditional Control to Text-to-Image Diffusion Models Paper β’ 2302.05543 β’ Published Feb 10, 2023 β’ 45
Training Large Language Models to Reason in a Continuous Latent Space Paper β’ 2412.06769 β’ Published about 1 month ago β’ 69
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling Paper β’ 2412.05271 β’ Published Dec 6, 2024 β’ 124
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper β’ 2411.16489 β’ Published Nov 25, 2024 β’ 41
view article Article ColPali: Efficient Document Retrieval with Vision Language Models π By manu β’ Jul 5, 2024 β’ 185
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Paper β’ 2406.08464 β’ Published Jun 12, 2024 β’ 66
view article Article Falcon 2: An 11B parameter pretrained language model and VLM, trained on over 5000B tokens tokens and 11 languages May 24, 2024 β’ 25
view article Article Hugging Face x LangChain : A new partner package in LangChain May 14, 2024 β’ 115
view article Article PaliGemma β Google's Cutting-Edge Open Vision Language Model May 14, 2024 β’ 232
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model Paper β’ 2405.04434 β’ Published May 7, 2024 β’ 14
Meta Llama 3 Collection This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases β’ 5 items β’ Updated Dec 6, 2024 β’ 699
Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity Paper β’ 2403.14403 β’ Published Mar 21, 2024 β’ 6
Evolutionary Optimization of Model Merging Recipes Paper β’ 2403.13187 β’ Published Mar 19, 2024 β’ 50
MM-LLMs: Recent Advances in MultiModal Large Language Models Paper β’ 2401.13601 β’ Published Jan 24, 2024 β’ 45