Text Generation
Transformers
English
llama
Inference Endpoints
TinyLlama-1.1B-Chat-v0.3-8.0bpw-h8-exl2 / generation_config.json
LoneStriker's picture
ExLLaMA V2 quant of TinyLlama-1.1B-Chat-v0.3-8.0bpw-h8-exl2
29901b1
raw
history blame
68 Bytes
{
"max_new_tokens": 32,
"transformers_version": "4.34.0.dev0"
}