new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Jun 25

Submitted by

yuangpeng

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

·
10 authors

Submitted by

ellisbrown

Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

·
14 authors

Submitted by

terryyz

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

·
33 authors

Submitted by

Royir

Evaluating D-MERIT of Partial-annotation on Information Retrieval

·
7 authors

Submitted by

PY007

Long Context Transfer from Language to Vision

·
10 authors

Submitted by

adamdad

Video-Infinity: Distributed Long Video Generation

·
4 authors

Submitted by

zlzheng

VideoHallucer: Evaluating Intrinsic and Extrinsic Hallucinations in Large Video-Language Models

·
5 authors

Submitted by

Ryan1122

Scaling Laws for Linear Complexity Language Models

·
6 authors

Submitted by

alexrame

WARP: On the Benefits of Weight Averaged Rewarded Policies

·
10 authors

Submitted by

YiDuo1999

Efficient Continual Pre-training by Mitigating the Stability Gap

·
5 authors

Submitted by

Kthyeon

Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters

·
5 authors

Submitted by

zlzheng

Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers

·
4 authors

Submitted by

ShengdingHu

Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models

·
9 authors

Submitted by

jlko

Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs

·
6 authors

Submitted by

yongzx

Preference Tuning For Toxicity Mitigation Generalizes Across Languages

·
3 authors

Submitted by

gsarti

Confidence Regulation Neurons in Language Models

·
7 authors

Submitted by

CCCCCC

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

·
9 authors

Submitted by

sherzod-hakimov

How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics

·
4 authors

Submitted by

tangjs

ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians

·
7 authors

Submitted by

cydhsieh01

Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization

·
11 authors

Submitted by

fangqi

IRASim: Learning Interactive Real-Robot Action Simulators

·
6 authors

Submitted by

cattana

Can Few-shot Work in Long-Context? Recycling the Context to Generate Demonstrations

·
11 authors

Submitted by

BrianatCambridge

video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models

·
10 authors

Submitted by

nicozilber

Repulsive Score Distillation for Diverse Sampling of Diffusion Models

·
3 authors

Submitted by

SinclairWang

OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?

·
4 authors