Ameya Sunil Mahabaleshwarkar's picture

3 1 2

Ameya Sunil Mahabaleshwarkar

ameyasunilm

·

AI & ML interests

Deep Learning, NLP, LLM

Recent Activity

authored a paper 4 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

liked a model 5 months ago

nvidia/Mistral-NeMo-Minitron-8B-Instruct

new activity 6 months ago

nvidia/Nemotron-Mini-4B-Instruct:Minor issues with the chat template during fine-tuning

View all activity

Organizations

ameyasunilm's activity

authored a paper 4 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 42

liked a model 5 months ago

nvidia/Mistral-NeMo-Minitron-8B-Instruct

Text Generation • Updated Oct 9, 2024 • 3.99k • 74

New activity in nvidia/Nemotron-Mini-4B-Instruct 6 months ago

Minor issues with the chat template during fine-tuning

#3 opened 6 months ago by

Issue of tool call generation

#2 opened 6 months ago by

upvoted a paper 7 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 58

liked a model about 1 year ago

nvidia/nemotron-3-8b-chat-4k-steerlm

Text Generation • Updated Feb 9, 2024 • 1 • 21