Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper • 2412.13663 • Published 7 days ago • 103 • 8
The Case for Co-Designing Model Architectures with Hardware Paper • 2401.14489 • Published Jan 25 • 3 • 1
Meltemi: The first open Large Language Model for Greek Paper • 2407.20743 • Published Jul 30 • 67 • 4
Efficient Guided Generation for Large Language Models Paper • 2307.09702 • Published Jul 19, 2023 • 8 • 1