Text Generation
Transformers
GGUF
PyTorch
Safetensors
mistral
quantized
2-bit
3-bit
4-bit precision
5-bit
6-bit
8-bit precision
GGUF
llama
en
dataset:HuggingFaceH4/ultrafeedback_binarized
dataset:allenai/tulu-v2-sft-mixture
arxiv:2305.18290
arxiv:2311.10702
Inference Endpoints
has_space
text-generation-inference
File size: 136 Bytes