Example of using local LLMs

#2
by russellsparadox - opened

In the code you use requests.post(API_URL, headers=headers, json=payload, stream=True) to get the streaming. What if I just have GPT-J-6B that I downloaded locally. Is there a way to get streaming?

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment