Running
on
Zero
7
๐ฆ
Llama Cpp Agent
Chat: llama cpp agent
A retrieval system with chatbot integration
View how beam search decoding works, in detail!
Efficient quantized retrieval over Wikipedia
text streaming space using Gemma-7B