Re3: Generating Longer Stories With Recursive Reprompting and Revision Paper • 2210.06774 • Published Oct 13, 2022 • 2
AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls Paper • 2402.04253 • Published Feb 6
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate Paper • 2305.19118 • Published May 30, 2023
CAMEL: Communicative Agents for "Mind" Exploration of Large Scale Language Model Society Paper • 2303.17760 • Published Mar 31, 2023 • 1
Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization Paper • 2310.02170 • Published Oct 3, 2023 • 2
MetaGPT: Meta Programming for Multi-Agent Collaborative Framework Paper • 2308.00352 • Published Aug 1, 2023 • 2
Generative Agents: Interactive Simulacra of Human Behavior Paper • 2304.03442 • Published Apr 7, 2023 • 12
Voyager: An Open-Ended Embodied Agent with Large Language Models Paper • 2305.16291 • Published May 25, 2023 • 9
Large Language Models for Autonomous Driving: Real-World Experiments Paper • 2312.09397 • Published Dec 14, 2023
The Rise and Potential of Large Language Model Based Agents: A Survey Paper • 2309.07864 • Published Sep 14, 2023 • 7
Efficient Streaming Language Models with Attention Sinks Paper • 2309.17453 • Published Sep 29, 2023 • 13
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention Paper • 2404.07143 • Published Apr 10 • 104
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation Paper • 2108.12409 • Published Aug 27, 2021 • 5
Extending Context Window of Large Language Models via Positional Interpolation Paper • 2306.15595 • Published Jun 27, 2023 • 53
RoFormer: Enhanced Transformer with Rotary Position Embedding Paper • 2104.09864 • Published Apr 20, 2021 • 11
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper • 2312.00752 • Published Dec 1, 2023 • 138
Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning Paper • 2305.14160 • Published May 23, 2023 • 1
Studying Large Language Model Generalization with Influence Functions Paper • 2308.03296 • Published Aug 7, 2023 • 12
A Comprehensive Study of Knowledge Editing for Large Language Models Paper • 2401.01286 • Published Jan 2 • 16
Can Large Language Models Explain Themselves? A Study of LLM-Generated Self-Explanations Paper • 2310.11207 • Published Oct 17, 2023
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback Paper • 2305.14975 • Published May 24, 2023 • 1
Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs Paper • 2306.13063 • Published Jun 22, 2023
Language Models Don't Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting Paper • 2305.04388 • Published May 7, 2023 • 1
On Large Language Models' Selection Bias in Multi-Choice Questions Paper • 2309.03882 • Published Sep 7, 2023
Fast Inference from Transformers via Speculative Decoding Paper • 2211.17192 • Published Nov 30, 2022 • 4
Accelerating Large Language Model Decoding with Speculative Sampling Paper • 2302.01318 • Published Feb 2, 2023 • 2
Inference with Reference: Lossless Acceleration of Large Language Models Paper • 2304.04487 • Published Apr 10, 2023
Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations Paper • 2402.12786 • Published Feb 20
Paralinguistics-Enhanced Large Language Modeling of Spoken Dialogue Paper • 2312.15316 • Published Dec 23, 2023
Towards General-Purpose Text-Instruction-Guided Voice Conversion Paper • 2309.14324 • Published Sep 25, 2023
PromptTTS: Controllable Text-to-Speech with Text Descriptions Paper • 2211.12171 • Published Nov 22, 2022
BASE TTS: Lessons from building a billion-parameter Text-to-Speech model on 100K hours of data Paper • 2402.08093 • Published Feb 12 • 57
Toward Joint Language Modeling for Speech Units and Text Paper • 2310.08715 • Published Oct 12, 2023 • 7
Can Large Language Models Be an Alternative to Human Evaluations? Paper • 2305.01937 • Published May 3, 2023 • 2
A Closer Look into Automatic Evaluation Using Large Language Models Paper • 2310.05657 • Published Oct 9, 2023
Judging LLM-as-a-judge with MT-Bench and Chatbot Arena Paper • 2306.05685 • Published Jun 9, 2023 • 31
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators Paper • 2404.04475 • Published Apr 6
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models Paper • 2206.04615 • Published Jun 9, 2022 • 5
Do the Rewards Justify the Means? Measuring Trade-Offs Between Rewards and Ethical Behavior in the MACHIAVELLI Benchmark Paper • 2304.03279 • Published Apr 6, 2023 • 1
Sparks of Artificial General Intelligence: Early experiments with GPT-4 Paper • 2303.12712 • Published Mar 22, 2023 • 2
Rethinking Benchmark and Contamination for Language Models with Rephrased Samples Paper • 2311.04850 • Published Nov 8, 2023