arxiv:2411.13676
Ameya Sunil Mahabaleshwarkar
ameyasunilm
AI & ML interests
Deep Learning, NLP, LLM
Recent Activity
authored
a paper
3 days ago
Hymba: A Hybrid-head Architecture for Small Language Models
liked
a model
about 2 months ago
nvidia/Mistral-NeMo-Minitron-8B-Instruct
Organizations
Papers
1
models
None public yet
datasets
None public yet