-
Levels of AGI: Operationalizing Progress on the Path to AGI
Paper • 2311.02462 • Published • 32 -
Ultra-Long Sequence Distributed Transformer
Paper • 2311.02382 • Published • 2 -
A Survey on Language Models for Code
Paper • 2311.07989 • Published • 21 -
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper • 2311.09213 • Published • 12
Collections
Discover the best community collections!
Collections including paper arxiv:2311.02462
-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 96 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 74 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 41 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 40
-
Language Modeling Is Compression
Paper • 2309.10668 • Published • 82 -
Small-scale proxies for large-scale Transformer training instabilities
Paper • 2309.14322 • Published • 19 -
Evaluating Cognitive Maps and Planning in Large Language Models with CogEval
Paper • 2309.15129 • Published • 6 -
Vision Transformers Need Registers
Paper • 2309.16588 • Published • 77
-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 75 -
Natural Language Supervision for General-Purpose Audio Representations
Paper • 2309.05767 • Published • 9 -
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 52 -
AudioSR: Versatile Audio Super-resolution at Scale
Paper • 2309.07314 • Published • 24