Abid Ali Awan

kingabzpro

AI & ML interests

LLMs, MLOps, ASR, & RL

Organizations

kingabzpro's activity

replied to their post 15 days ago
view reply

@victor I think the community is eagerly awaiting the next big month-long event, where the community can come together to build something, like we used to do in the past.

posted an update 15 days ago
view post
Post
890
I believe Hugging Face should have something similar to Hacktoberfest. I miss the days when there were events like this every 3 months for audio, deep reinforcement learning, gradio themes, but it turns out everything slowed down. There are no more Hugging Face events.
@victor
  • 3 replies
ยท
posted an update 18 days ago
view post
Post
1089
I never imagined that Jenkins could be as powerful and easy to implement as GitHub Actions. Loving it. ๐Ÿฅฐ
replied to their post 18 days ago
view reply

I'm having some issues with the RAG pipeline. It generally takes 0.2-2 seconds for it to respond, and most of the time the embedding model takes even longer. I can implement prompt caching, but I was considering a more hardware-related solution. What do you think about using Ray for distributed serving? Also, what do you think about GraphQL?

posted an update 21 days ago
view post
Post
1824
How can I make my RAG application generate real-time responses? Up until now, I have been using Groq for fast LLM generation and the Gradio Live function. I am looking for a better solution that can help me build a real-time application without any delay. @abidlabs

kingabzpro/Real-Time-RAG
  • 2 replies
ยท