view article Article From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate Jun 13, 2024 • 45
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 134