Submitted by PhoenixZ 59 OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference · 13 authors 2
Submitted by akhaliq 46 SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution · 9 authors 4
Submitted by jt-zhang 45 SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference · 7 authors 2
Submitted by GlyphByT5 28 ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation · 17 authors 3
Submitted by xilluill 27 KV-Edit: Training-Free Image Editing for Precise Background Preservation · 4 authors 3
Submitted by Lucky2022 16 Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective · 5 authors 2
Submitted by AmberLJC 14 Curie: Toward Rigorous and Automated Scientific Experimentation with AI Agents · 10 authors 5
Submitted by Paper99 12 K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs · 3 authors 2
Submitted by Taoer 12 Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models · 6 authors 2
Submitted by rp-yu 10 Introducing Visual Perception Token into Multimodal Large Language Model · 3 authors 2
Submitted by Dominic789654 6 The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve? · 7 authors 2
Submitted by oceanpty 5 Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization · 7 authors 2
Submitted by jrzhang 3 MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs · 4 authors 2
Submitted by twigs 3 LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models · 2 authors 2
Submitted by SyedAbdul 3 Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI · 3 authors 2
Submitted by Kinpz 2 LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation · 8 authors 2
Submitted by ahmedselhady 1 WiCkeD: A Simple Method to Make Multiple Choice Benchmarks More Challenging · 3 authors 2