DialSim: A Real-Time Simulator for Evaluating Long-Term Dialogue Understanding of Conversational Agents Paper • 2406.13144 • Published 8 days ago • 3
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation Paper • 2406.16678 • Published 2 days ago • 6
MotionBooth: Motion-Aware Customized Text-to-Video Generation Paper • 2406.17758 • Published 1 day ago • 10
On the Transformations across Reward Model, Parameter Update, and In-Context Prompt Paper • 2406.16377 • Published 3 days ago • 6
Aligning Diffusion Models with Noise-Conditioned Perception Paper • 2406.17636 • Published 1 day ago • 17
DiffusionPDE: Generative PDE-Solving Under Partial Observation Paper • 2406.17763 • Published 1 day ago • 17
LongIns: A Challenging Long-context Instruction-based Exam for LLMs Paper • 2406.17588 • Published 1 day ago • 14
Unlocking Continual Learning Abilities in Language Models Paper • 2406.17245 • Published 1 day ago • 11
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning Paper • 2406.17770 • Published 1 day ago • 11
FreeTraj: Tuning-Free Trajectory Control in Video Diffusion Models Paper • 2406.16863 • Published 2 days ago • 6
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals Paper • 2406.16273 • Published 3 days ago • 8
Can Few-shot Work in Long-Context? Recycling the Context to Generate Demonstrations Paper • 2406.13632 • Published 7 days ago • 4
Repulsive Score Distillation for Diverse Sampling of Diffusion Models Paper • 2406.16683 • Published 2 days ago • 2
Found in the Middle: Calibrating Positional Attention Bias Improves Long Context Utilization Paper • 2406.16008 • Published 4 days ago • 3
video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models Paper • 2406.15704 • Published 5 days ago • 4
OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far? Paper • 2406.16772 • Published 2 days ago • 3
ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians Paper • 2406.16815 • Published 2 days ago • 5
AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models Paper • 2406.16714 • Published 2 days ago • 8
Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers Paper • 2406.16747 • Published 2 days ago • 12
Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs Paper • 2406.16860 • Published 2 days ago • 32
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published 4 days ago • 36
WARP: On the Benefits of Weight Averaged Rewarded Policies Paper • 2406.16768 • Published 2 days ago • 15
Preference Tuning For Toxicity Mitigation Generalizes Across Languages Paper • 2406.16235 • Published 3 days ago • 10
Efficient Continual Pre-training by Mitigating the Stability Gap Paper • 2406.14833 • Published 6 days ago • 16
Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters Paper • 2406.16758 • Published 2 days ago • 15
VideoHallucer: Evaluating Intrinsic and Extrinsic Hallucinations in Large Video-Language Models Paper • 2406.16338 • Published 3 days ago • 21
DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation Paper • 2406.16855 • Published 2 days ago • 49
Cognitive Map for Language Models: Optimal Planning via Verbally Representing the World Model Paper • 2406.15275 • Published 5 days ago • 9
Towards Retrieval Augmented Generation over Large Video Libraries Paper • 2406.14938 • Published 6 days ago • 17
EvTexture: Event-driven Texture Enhancement for Video Super-Resolution Paper • 2406.13457 • Published 7 days ago • 12
Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges Paper • 2406.12624 • Published 8 days ago • 31
Multimodal Structured Generation: CVPR's 2nd MMFM Challenge Technical Report Paper • 2406.11403 • Published 9 days ago • 4
Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models Paper • 2406.14035 • Published 7 days ago • 9
How Well Do LLMs Represent Values Across Cultures? Empirical Analysis of LLM Responses Based on Hofstede Cultural Dimensions Paper • 2406.14805 • Published 6 days ago • 3
Stylebreeder: Exploring and Democratizing Artistic Styles through Text-to-Image Models Paper • 2406.14599 • Published 6 days ago • 14
MantisScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation Paper • 2406.15252 • Published 5 days ago • 12
LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs Paper • 2406.15319 • Published 5 days ago • 46
A Systematic Survey of Text Summarization: From Statistical Methods to Large Language Models Paper • 2406.11289 • Published 10 days ago • 4
From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP Paper • 2406.12618 • Published 8 days ago • 4
StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images Paper • 2406.13735 • Published 7 days ago • 5
Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation Paper • 2406.13663 • Published 7 days ago • 7
Model Merging and Safety Alignment: One Bad Model Spoils the Bunch Paper • 2406.14563 • Published 6 days ago • 26
nabla^2DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials Paper • 2406.14347 • Published 6 days ago • 93
The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing Paper • 2406.10601 • Published 11 days ago • 63
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models Paper • 2406.13542 • Published 7 days ago • 14
Improving Visual Commonsense in Language Models via Multiple Image Generation Paper • 2406.13621 • Published 7 days ago • 13
HARE: HumAn pRiors, a key to small language model Efficiency Paper • 2406.11410 • Published 9 days ago • 35
LiveMind: Low-latency Large Language Models with Simultaneous Inference Paper • 2406.14319 • Published 6 days ago • 13
Invertible Consistency Distillation for Text-Guided Image Editing in Around 7 Steps Paper • 2406.14539 • Published 6 days ago • 24
τ-bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains Paper • 2406.12045 • Published 9 days ago • 4
ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning Paper • 2406.14130 • Published 6 days ago • 10
Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level Paper • 2406.11817 • Published 9 days ago • 13
REPOEXEC: Evaluate Code Generation with a Repository-Level Executable Benchmark Paper • 2406.11927 • Published 9 days ago • 7
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper • 2406.14491 • Published 6 days ago • 64
Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities Paper • 2406.14562 • Published 6 days ago • 25
PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents Paper • 2406.13923 • Published 7 days ago • 20