Ameya Sunil Mahabaleshwarkar's picture

3 1 2

Ameya Sunil Mahabaleshwarkar

ameyasunilm

AI & ML interests

Deep Learning, NLP, LLM

Recent Activity

authored a paper 3 days ago

Hymba: A Hybrid-head Architecture for Small Language Models

liked a model about 2 months ago

nvidia/Mistral-NeMo-Minitron-8B-Instruct

View all activity

Organizations

ameyasunilm's activity

authored a paper 3 days ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published 5 days ago • 33

liked a model about 2 months ago

nvidia/Mistral-NeMo-Minitron-8B-Instruct

Text Generation • Updated Oct 9 • 3.38k • 64

New activity in nvidia/Nemotron-Mini-4B-Instruct 2 months ago

Minor issues with the chat template during fine-tuning

#3 opened 2 months ago by

Issue of tool call generation

#2 opened 2 months ago by

upvoted a paper 3 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21 • 55

liked a model 10 months ago

nvidia/nemotron-3-8b-chat-4k-steerlm

Text Generation • Updated Feb 9 • 21