Text Generation
Transformers
Safetensors
English
llama
Not-For-All-Audiences
conversational
text-generation-inference
Inference Endpoints

Linked imatrix GGUF quants are based on older llama.cpp without rope fix

#3
by bartowski - opened

Not-For-All-Audiences

This repository has been marked as containing sensitive content and may contain potentially harmful and sensitive information.

View content