Text Generation
Transformers
English
llama
text-generation-inference
Inference Endpoints

Commit History

ExLLaMA V2 quant of TinyLlama-1.1B-Chat-v0.3-4.0bpw-h6-exl2
43fab80

LoneStriker commited on