new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Feb 26

Submitted by

PhoenixZ

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

·
13 authors

Submitted by

akhaliq

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

·
9 authors

Submitted by

jt-zhang

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

·
7 authors

Submitted by

GlyphByT5

ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation

·
17 authors

Submitted by

xilluill

KV-Edit: Training-Free Image Editing for Precise Background Preservation

·
4 authors

Submitted by

Lucky2022

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

·
5 authors

Submitted by

AmberLJC

Curie: Toward Rigorous and Automated Scientific Experimentation with AI Agents

·
10 authors

Submitted by

Paper99

K-LoRA: Unlocking Training-Free Fusion of Any Subject and Style LoRAs

·
3 authors

Submitted by

Taoer

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

·
6 authors

Submitted by

rp-yu

Introducing Visual Perception Token into Multimodal Large Language Model

·
3 authors

Submitted by

akhaliq

WebGames: Challenging General-Purpose Web-Browsing AI Agents

·
8 authors

Submitted by

Dominic789654

The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve?

·
7 authors

Submitted by

akhaliq

Prompt-to-Leaderboard

·
7 authors

Submitted by

oceanpty

Finding the Sweet Spot: Preference Data Construction for Scaling Preference Optimization

·
7 authors

Submitted by

xi-j

AAD-LLM: Neural Attention-Driven Auditory Scene Understanding

·
9 authors

Submitted by

jrzhang

MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs

·
4 authors

Submitted by

akhaliq

An Overview of Large Language Models for Statisticians

·
10 authors

Submitted by

twigs

LaTIM: Measuring Latent Token-to-Token Interactions in Mamba Models

·
2 authors

Submitted by

SyedAbdul

Shakti-VLMs: Scalable Vision-Language Models for Enterprise AI

·
3 authors

Submitted by

Kinpz

LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation

·
8 authors

Submitted by

Ksgk-fy

Scaling LLM Pre-training with Vocabulary Curriculum

·
1 authors

Submitted by

ahmedselhady

WiCkeD: A Simple Method to Make Multiple Choice Benchmarks More Challenging

·
3 authors