nm-testing
/

SparseLlama-3.1-8B-gsm8k-pruned.2of4-tensor_wts_per_tok_dyn_act_int8-BitM

8-bit precision

compressed-tensors

Model card Files Files and versions Community

SparseLlama-3.1-8B-gsm8k-pruned.2of4-tensor_wts_per_tok_dyn_act_int8-BitM

1 contributor

History: 2 commits

RelaxingSnorlax's picture

RelaxingSnorlax

Upload folder using huggingface_hub

e1275a6 verified about 2 months ago

.gitattributes

1.57 kB

Upload folder using huggingface_hub about 2 months ago
config.json

2.26 kB

Upload folder using huggingface_hub about 2 months ago
generation_config.json

180 Bytes

Upload folder using huggingface_hub about 2 months ago
model-00001-of-00002.safetensors

5 GB
LFS

Upload folder using huggingface_hub about 2 months ago
model-00002-of-00002.safetensors

1.47 GB
LFS

Upload folder using huggingface_hub about 2 months ago
model.safetensors.index.json

80.7 kB

Upload folder using huggingface_hub about 2 months ago
recipe.yaml

596 Bytes

Upload folder using huggingface_hub about 2 months ago
special_tokens_map.json

449 Bytes

Upload folder using huggingface_hub about 2 months ago
tokenizer.json

17.2 MB
LFS

Upload folder using huggingface_hub about 2 months ago
tokenizer_config.json

50.5 kB

Upload folder using huggingface_hub about 2 months ago