switch-c-2048_qmoe
This is the google/switch-c-2048 model quantized with the QMoE framework to ternary precision and stored in the custom further compressed QMoE format.
Please see the QMoE repository for how to use this model.
- Downloads last month
- 7
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no pipeline_tag.