Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nm-testing
/
SparseLlama-3.1-8B-gsm8k-pruned.2of4-tensor_wts_per_tok_dyn_act_int8-BitM
like
0
Follow
NM Testing
38
Safetensors
llama
8-bit precision
compressed-tensors
Model card
Files
Files and versions
Community
Train
e1275a6
SparseLlama-3.1-8B-gsm8k-pruned.2of4-tensor_wts_per_tok_dyn_act_int8-BitM
1 contributor
History:
2 commits
RelaxingSnorlax
Upload folder using huggingface_hub
e1275a6
verified
13 days ago
.gitattributes
Safe
1.57 kB
Upload folder using huggingface_hub
13 days ago
config.json
Safe
2.26 kB
Upload folder using huggingface_hub
13 days ago
generation_config.json
Safe
180 Bytes
Upload folder using huggingface_hub
13 days ago
model-00001-of-00002.safetensors
Safe
5 GB
LFS
Upload folder using huggingface_hub
13 days ago
model-00002-of-00002.safetensors
Safe
1.47 GB
LFS
Upload folder using huggingface_hub
13 days ago
model.safetensors.index.json
Safe
80.7 kB
Upload folder using huggingface_hub
13 days ago
recipe.yaml
Safe
596 Bytes
Upload folder using huggingface_hub
13 days ago
special_tokens_map.json
Safe
449 Bytes
Upload folder using huggingface_hub
13 days ago
tokenizer.json
Safe
17.2 MB
LFS
Upload folder using huggingface_hub
13 days ago
tokenizer_config.json
Safe
50.5 kB
Upload folder using huggingface_hub
13 days ago