Code Evaluation Collection Collection of Papers on Code Evaluation (from code generation language models) • 45 items • Updated Oct 29, 2024 • 15
CodeXGLUE: A Machine Learning Benchmark Dataset for Code Understanding and Generation Paper • 2102.04664 • Published Feb 9, 2021 • 2
Classical Sorting Algorithms as a Model of Morphogenesis: self-sorting arrays reveal unexpected competencies in a minimal model of basal intelligence Paper • 2401.05375 • Published Dec 15, 2023 • 1
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models Paper • 2410.20771 • Published Oct 28, 2024 • 3
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 22 days ago • 80
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations Paper • 2006.11477 • Published Jun 20, 2020 • 5
No More Adam: Learning Rate Scaling at Initialization is All You Need Paper • 2412.11768 • Published 18 days ago • 41
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 16 days ago • 116
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated 15 days ago • 111
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper • 2412.10360 • Published 21 days ago • 136
CANINE: Pre-training an Efficient Tokenization-Free Encoder for Language Representation Paper • 2103.06874 • Published Mar 11, 2021 • 1
ByT5: Towards a token-free future with pre-trained byte-to-byte models Paper • 2105.13626 • Published May 28, 2021 • 3
W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training Paper • 2108.06209 • Published Aug 7, 2021 • 1
StarCraft II: A New Challenge for Reinforcement Learning Paper • 1708.04782 • Published Aug 16, 2017 • 1