16 6 190

Sourab Mangrulkar

smangrul

https://www.linkedin.com/in/sourab-m/

pacman100

AI & ML interests

Machine Learning, Deep Learning, Natural Language Processing, Natural Language Generation, Computer Vision, Reinforcement Learning

Recent Activity

liked a Space 7 days ago

huggingface/ai-deadlines

liked a model about 2 months ago

mistralai/Mistral-7B-Instruct-v0.3

liked a model about 2 months ago

mistralai/Mamba-Codestral-7B-v0.1

View all activity

Organizations

smangrul's activity

liked a Space 7 days ago

303

AI Deadlines

⚡

Generate project deadlines

liked 2 models about 2 months ago

mistralai/Mistral-7B-Instruct-v0.3

Text Generation • Updated Aug 21, 2024 • 709k • • 1.45k

mistralai/Mamba-Codestral-7B-v0.1

Updated Aug 23, 2024 • 3.26k • 578

liked a model 3 months ago

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • Updated Dec 4, 2024 • 1.06M • • 1.36k

updated a Space 4 months ago

PEFT Docs QA Chatbot

📚

Ask questions about PEFT docs and get answers

liked 3 models 6 months ago

liked a model 7 months ago

microsoft/Phi-3-mini-128k-instruct

Text Generation • Updated 3 days ago • 128k • • 1.64k

liked a model 8 months ago

microsoft/Phi-3-mini-4k-instruct

Text Generation • Updated Sep 20, 2024 • 913k • • 1.14k

upvoted a collection 8 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 563

liked a dataset 10 months ago

internlm/Agent-FLAN

Preview • Updated Mar 20, 2024 • 128 • 72

liked a Space 10 months ago

Nexus Function Calling Leaderboard

🐠

Visualize model performance on function calling tasks

liked a model 10 months ago

gorilla-llm/gorilla-openfunctions-v2

Text Generation • Updated Apr 18, 2024 • 620 • 224

liked a dataset 11 months ago

smangrul/hug_stack

Viewer • Updated Feb 2, 2024 • 6.58k • 162 • 3

updated a model 11 months ago

smangrul/peft-lora-whisper-largev2-cv11-mr-t4-colab

Updated Apr 23, 2024 • 21 • 1

liked a model 11 months ago

smangrul/falcon-180B-chat-asst-ds-lora

Updated Oct 2, 2023 • 3 • 1

posted an update 11 months ago

Post

3542

Unlocking the Power of locally running Llama-3 8B Model Agents with Chat-UI! 🔥🚀✨

I'm thrilled to share my hackathon-style side project:
1. Finetuning Llama-8B for function calling using PEFT QLoRA as the instruct Llama-3 model doesn't support this. The colab notebook for it is here: https://lnkd.in/ggJMzqh2. 🛠️
2. Finetuned model along with the 4-bit quants here: https://lnkd.in/gNpFKY6V ✨
3. Clone Hugging Face https://lnkd.in/gKBKuUBQ and make it compatible for function calling by building upon the PR https://lnkd.in/gnqFuAd4 for my model and local inferencing usecase using Ollama. This was a steep learning curve wherein I stayed awake the whole night to get it working. 💪🏽
4. Above, I used SerpAPI for web browsing and Mongo DB Atlas free tier for persistence of conversations and assistant configs. 🔎
5. More work is required to switch between using tools and responding directly wherein I see the model breaks. 🧐

How cool is this wherein we are approaching experience akin to ChatGPT while using local hosted agent model running on your laptop! 💻

1 reply

updated a model 11 months ago

smangrul/llama-3-8B-instruct-function-calling

Updated Apr 21, 2024 • 92 • 29

liked a model 11 months ago

smangrul/llama-3-8B-instruct-function-calling

Updated Apr 21, 2024 • 92 • 29