Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning Paper • 2503.04973 • Published 21 days ago • 21
Rank1: Test-Time Compute for Reranking in Information Retrieval Paper • 2502.18418 • Published 30 days ago • 26
Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering Paper • 2502.13962 • Published Feb 19 • 28
Compressed Chain of Thought: Efficient Reasoning Through Dense Representations Paper • 2412.13171 • Published Dec 17, 2024 • 34
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models Paper • 2409.11136 • Published Sep 17, 2024 • 23
The Consensus Game: Language Model Generation via Equilibrium Search Paper • 2310.09139 • Published Oct 13, 2023 • 14
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences Paper • 2306.07906 • Published Jun 13, 2023 • 13