Phando
/

fairseq-moe-15b-bf16

Text Generation

Inference Endpoints

Model card Files Files and versions Community

This is a Hugging Face transformers-style conversion of the original SMoE 15B-parameter model with BFLOAT16 from the paper "Efficient Large Scale Language Modeling with Mixtures of Experts" from Artetxe et al. The original model card can be found at https://github.com/facebookresearch/fairseq/blob/main/examples/moe_lm/model_card.md.

The usage example and modeling code can be found at https://github.com/pingzhili/light-fairseq

Downloads last month: 21

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.