opt-30b-deepspeed-inference-fp16-shard-2 / ds_inference_config.json

Commit History

added tp sharded ckpts
8c8b767

lucadiliello commited on