Mayank Mishra

mayank-mishra

AI & ML interests

Large Language Models, Distributed Training and Inference

Recent Activity

upvoted a paper 20 days ago
upvoted a collection 21 days ago
SmolLM2
New activity 29 days ago
ibm-granite/granite-3.0-2b-instruct

Articles

Organizations

mayank-mishra's activity

upvoted an article 3 months ago
view article
Article

Improving Hugging Face Training Efficiency Through Packing with Flash Attention

22
upvoted an article 5 months ago
upvoted an article 7 months ago
view article
Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

78