Text Generation
Transformers
Safetensors
llama
sparse
instruct
text-generation-inference
shubhrapandit's picture
Convert model to BFloat16 and shard using SafeTensors
bdd5a31