Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Nethermind
/
Mpt-Instruct-DotNet-XS
like
0
Text Generation
Transformers
PyTorch
English
mosaic_gpt
csharp
mpt
instruct
1b
llm
.net
custom_code
License:
cc-by-sa-3.0
Model card
Files
Files and versions
Community
Train
Use this model
main
Mpt-Instruct-DotNet-XS
1 contributor
History:
5 commits
Kabumbus
GGML models that can run f16 41.68 ms per token and q8 23.76 ms per token giving good results
56d7c99
10 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
README.md
3.78 kB
Usage example
10 months ago
attention.py
13.8 kB
Trained model
10 months ago
config.json
1.08 kB
Trained model
10 months ago
configuration_mosaic_gpt.py
8.87 kB
Trained model
10 months ago
generation_config.json
91 Bytes
Trained model
10 months ago
ggml-model-f16.bin
2.62 GB
LFS
GGML models that can run f16 41.68 ms per token and q8 23.76 ms per token giving good results
10 months ago
ggml-model-q8_0.bin
1.39 GB
LFS
GGML models that can run f16 41.68 ms per token and q8 23.76 ms per token giving good results
10 months ago
gpt_blocks.py
3.11 kB
Trained model
10 months ago
low_precision_layernorm.py
1.27 kB
Trained model
10 months ago
mosaic_gpt.py
19.5 kB
Trained model
10 months ago
param_init_fns.py
15.9 kB
Trained model
10 months ago
pytorch_model.bin
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.BFloat16Storage"
What is a pickle import?
2.62 GB
LFS
Trained model
10 months ago
special_tokens_map.json
131 Bytes
Trained model
10 months ago
tokenizer.json
2.11 MB
Trained model
10 months ago
tokenizer_config.json
264 Bytes
Trained model
10 months ago