new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Nov 25

Submitted by

akhaliq

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

·
23 authors

Submitted by

adamdad

OminiControl: Minimal and Universal Control for Diffusion Transformer

·
5 authors

Submitted by

xanderhuang

Material Anything: Generating Materials for Any 3D Object via Diffusion

·
4 authors

Submitted by

chaehun

Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator

·
4 authors

Submitted by

gabrielchua

A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection

·
3 authors

Submitted by

akhaliq

MyTimeMachine: Personalized Facial Age Transformation

·
6 authors

Submitted by

pagli98

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

·
13 authors

Submitted by

kcz358

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

·
4 authors

Submitted by

JackyZhuo

VideoEspresso: A Large-Scale Chain-of-Thought Dataset for Fine-Grained Video Reasoning via Core Frame Selection

·
10 authors

Submitted by

younggyoseo

Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction

·
5 authors

Submitted by

KunhaoLiu

Novel View Extrapolation with Video Diffusion Priors

·
3 authors

Submitted by

j-min

VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement

·
4 authors

Submitted by

dnoever

The Impossible Test: A 2024 Unsolvable Dataset and A Chance for an AGI Quiz

·
2 authors

Submitted by

akhaliq

WildLMa: Long Horizon Loco-Manipulation in the Wild

·
11 authors

Submitted by

JusperLee

Adapting Vision Foundation Models for Robust Cloud Segmentation in Remote Sensing Images

·
8 authors

Submitted by

colo286

One to rule them all: natural language to bind communication, perception and action

·
3 authors