@victor I think the community is eagerly awaiting the next big month-long event, where the community can come together to build something, like we used to do in the past.
Abid Ali Awan
kingabzpro
AI & ML interests
LLMs, MLOps, ASR, & RL
Organizations
kingabzpro's activity
replied to
their
post
15 days ago
posted
an
update
15 days ago
posted
an
update
18 days ago
Post
1089
I never imagined that Jenkins could be as powerful and easy to implement as GitHub Actions. Loving it. ๐ฅฐ
replied to
their
post
18 days ago
I'm having some issues with the RAG pipeline. It generally takes 0.2-2 seconds for it to respond, and most of the time the embedding model takes even longer. I can implement prompt caching, but I was considering a more hardware-related solution. What do you think about using Ray for distributed serving? Also, what do you think about GraphQL?
posted
an
update
21 days ago
Post
1824
How can I make my RAG application generate real-time responses? Up until now, I have been using Groq for fast LLM generation and the Gradio Live function. I am looking for a better solution that can help me build a real-time application without any delay.
@abidlabs
kingabzpro/Real-Time-RAG
kingabzpro/Real-Time-RAG