Sourab Mangrulkar

smangrul

AI & ML interests

Machine Learning, Deep Learning, Natural Language Processing, Natural Language Generation, Computer Vision, Reinforcement Learning

Recent Activity

updated a Space about 1 month ago
smangrul/PEFT-Docs-QA-Chatbot
View all activity

Articles

Organizations

Speech Recognition Community Event Version 2's profile picture BigScience Data's profile picture group2's profile picture BigCode's profile picture Diffusers Pipelines Library for Stable Diffusion's profile picture Social Post Explorers's profile picture

smangrul's activity

updated a Space about 1 month ago
posted an update 8 months ago
view post
Post
3291
Unlocking the Power of locally running Llama-3 8B Model Agents with Chat-UI! ๐Ÿ”ฅ๐Ÿš€โœจ

I'm thrilled to share my hackathon-style side project:
1. Finetuning Llama-8B for function calling using PEFT QLoRA as the instruct Llama-3 model doesn't support this. The colab notebook for it is here: https://lnkd.in/ggJMzqh2. ๐Ÿ› ๏ธ
2. Finetuned model along with the 4-bit quants here: https://lnkd.in/gNpFKY6V โœจ
3. Clone Hugging Face https://lnkd.in/gKBKuUBQ and make it compatible for function calling by building upon the PR https://lnkd.in/gnqFuAd4 for my model and local inferencing usecase using Ollama. This was a steep learning curve wherein I stayed awake the whole night to get it working. ๐Ÿ’ช๐Ÿฝ
4. Above, I used SerpAPI for web browsing and Mongo DB Atlas free tier for persistence of conversations and assistant configs. ๐Ÿ”Ž
5. More work is required to switch between using tools and responding directly wherein I see the model breaks. ๐Ÿง

How cool is this wherein we are approaching experience akin to ChatGPT while using local hosted agent model running on your laptop! ๐Ÿ’ป
  • 1 reply
ยท
upvoted an article 8 months ago
view article
Article

Welcome Llama 3 - Meta's new open LLM

โ€ข 281