Avinash Sooriyarachchi's picture

2 1 83

Avinash Sooriyarachchi

AviSoori1x

·

https://www.linkedin.com/in/avi-data-ml/

AI & ML interests

I work at Mistral AI

Recent Activity

liked a Space about 1 month ago

Qwen/Qwen2.5-Coder-Artifacts

View all activity

Articles

seemore: Implement a Vision Language Model from Scratch

SeeMoE: Implementing a MoE Vision Language Model from Scratch

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Sparse Mixture of Experts Language Model from Scratch: Extending makeMoE with Expert Capacity

Organizations

AviSoori1x's activity

upvoted a paper 10 months ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6 • 183