Orca: Progressive Learning from Complex Explanation Traces of GPT-4 Paper • 2306.02707 • Published Jun 5, 2023 • 46
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published 11 days ago • 73