MotionBooth: Motion-Aware Customized Text-to-Video Generation Paper • 2406.17758 • Published 4 days ago • 15
APIGen: Automated Pipeline for Generating Verifiable and Diverse Function-Calling Datasets Paper • 2406.18518 • Published 3 days ago • 16
Repulsive Score Distillation for Diverse Sampling of Diffusion Models Paper • 2406.16683 • Published 5 days ago • 4
OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far? Paper • 2406.16772 • Published 5 days ago • 3
Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization Paper • 2406.16008 • Published 6 days ago • 6
IRASim: Learning Interactive Real-Robot Action Simulators Paper • 2406.14540 • Published 9 days ago • 6
ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians Paper • 2406.16815 • Published 5 days ago • 7
How Many Parameters Does it Take to Change a Light Bulb? Evaluating Performance in Self-Play of Conversational Games as a Function of Model Characteristics Paper • 2406.14051 • Published 9 days ago • 9
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models Paper • 2406.16714 • Published 5 days ago • 10
Preference Tuning For Toxicity Mitigation Generalizes Across Languages Paper • 2406.16235 • Published 5 days ago • 12
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs Paper • 2406.15927 • Published 6 days ago • 13
Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models Paper • 2406.15718 • Published 7 days ago • 13
Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers Paper • 2406.16747 • Published 5 days ago • 15
Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters Paper • 2406.16758 • Published 5 days ago • 16
WARP: On the Benefits of Weight Averaged Rewarded Policies Paper • 2406.16768 • Published 5 days ago • 19
Efficient Continual Pre-training by Mitigating the Stability Gap Paper • 2406.14833 • Published 8 days ago • 18
VideoHallucer: Evaluating Intrinsic and Extrinsic Hallucinations in Large Video-Language Models Paper • 2406.16338 • Published 5 days ago • 22
Evaluating D-MERIT of Partial-annotation on Information Retrieval Paper • 2406.16048 • Published 6 days ago • 33
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Paper • 2406.16860 • Published 5 days ago • 45
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published 7 days ago • 39
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation Paper • 2406.16855 • Published 5 days ago • 52
Reward Steering with Evolutionary Heuristics for Decoding-time Alignment Paper • 2406.15193 • Published 8 days ago • 11
Evaluating RAG-Fusion with RAGElo: an Automated Elo-based Framework Paper • 2406.14783 • Published 8 days ago • 14
EvTexture: Event-driven Texture Enhancement for Video Super-Resolution Paper • 2406.13457 • Published 10 days ago • 12
Complexity of Symbolic Representation in Working Memory of Transformer Correlates with the Complexity of a Task Paper • 2406.14213 • Published 9 days ago • 20
Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models Paper • 2406.14599 • Published 9 days ago • 16
Towards Retrieval Augmented Generation over Large Video Libraries Paper • 2406.14938 • Published 8 days ago • 18
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs Paper • 2406.15319 • Published 8 days ago • 52
Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges Paper • 2406.12624 • Published 11 days ago • 34
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Paper • 2206.04615 • Published Jun 9, 2022 • 5
Can Large Language Models Be an Alternative to Human Evaluations? Paper • 2305.01937 • Published May 3, 2023 • 2
VoCo-LLaMA: Towards Vision Compression with Large Language Models Paper • 2406.12275 • Published 11 days ago • 28
Bootstrapping Language Models with DPO Implicit Rewards Paper • 2406.09760 • Published 15 days ago • 36
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence Paper • 2406.11931 • Published 12 days ago • 54
TroL: Traversal of Layers for Large Language and Vision Models Paper • 2406.12246 • Published 11 days ago • 33
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities Paper • 2406.11768 • Published 12 days ago • 19
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models Paper • 2406.09416 • Published 16 days ago • 28
Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling Paper • 2406.07522 • Published 18 days ago • 34
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs Paper • 2406.10209 • Published 15 days ago • 8
Decoding the Diversity: A Review of the Indic AI Research Landscape Paper • 2406.09559 • Published 15 days ago • 5
MaskLID: Code-Switching Language Identification through Iterative Masking Paper • 2406.06263 • Published 19 days ago • 5
GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors Paper • 2406.10111 • Published 15 days ago • 6
AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis Paper • 2406.08920 • Published 16 days ago • 6
Vivid-ZOO: Multi-View Video Generation with Diffusion Model Paper • 2406.08659 • Published 16 days ago • 7
RVT-2: Learning Precise Manipulation from Few Demonstrations Paper • 2406.08545 • Published 17 days ago • 7
Designing a Dashboard for Transparency and Control of Conversational AI Paper • 2406.07882 • Published 17 days ago • 9
UI Agent Collection A collection of agents for user interfaces/interactions and UI program synthesis • 84 items • Updated 1 day ago • 1
VideoGUI: A Benchmark for GUI Automation from Instructional Videos Paper • 2406.10227 • Published 15 days ago • 8
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability,Reproducibility, and Practicality Paper • 2406.08845 • Published 16 days ago • 8