Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models Paper • 2411.07232 • Published 15 days ago • 60
M-Longdoc: A Benchmark For Multimodal Super-Long Document Understanding And A Retrieval-Aware Tuning Framework Paper • 2411.06176 • Published 17 days ago • 44
Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published 15 days ago • 30
Large Language Models Can Self-Improve in Long-context Reasoning Paper • 2411.08147 • Published 14 days ago • 59
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published 11 days ago • 99
Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding Paper • 2411.04282 • Published 20 days ago • 30
LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation Paper • 2411.04997 • Published 19 days ago • 35
RetrieveGPT: Merging Prompts and Mathematical Models for Enhanced Code-Mixed Information Retrieval Paper • 2411.04752 • Published 19 days ago • 16
Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model Paper • 2411.04496 • Published 19 days ago • 22
Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks? Paper • 2411.05000 • Published 19 days ago • 21
M3DocRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding Paper • 2411.04952 • Published 19 days ago • 27
OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models Paper • 2411.04905 • Published 19 days ago • 109
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level Paper • 2411.03562 • Published 20 days ago • 60
LLaMo: Large Language Model-based Molecular Graph Assistant Paper • 2411.00871 • Published 26 days ago • 21
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems Paper • 2411.02959 • Published 21 days ago • 64
Language Models can Self-Lengthen to Generate Long Texts Paper • 2410.23933 • Published 26 days ago • 16
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective Paper • 2410.23743 • Published 26 days ago • 59
CLEAR: Character Unlearning in Textual and Visual Modalities Paper • 2410.18057 • Published Oct 23 • 200