Naman Anand

naman5a

AI & ML interests

RAG , LLMs

Recent Activity

upvoted an article 5 days ago

Mixture of Experts Explained

liked a model 6 days ago

amd/Instella-3B

liked a model 17 days ago

Zyphra/Zonos-v0.1-hybrid

View all activity

Organizations

naman5a's activity

upvoted an article 5 days ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 454

liked a model 6 days ago

amd/Instella-3B

Text Generation • Updated 8 days ago • 729 • 31

liked a model 17 days ago

Zyphra/Zonos-v0.1-hybrid

Text-to-Speech • Updated 27 days ago • 58.4k • 1.04k

upvoted an article 18 days ago

Article

SigLIP 2: A better multilingual vision language encoder

22 days ago

• 134

upvoted an article 21 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

23 days ago

• 205

liked a Space 21 days ago

2.25k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 26 days ago

codelion/optillmbench

Viewer • Updated 25 days ago • 500 • 340 • 5

upvoted a paper about 1 month ago

MobileSAMv2: Faster Segment Anything to Everything

Paper • 2312.09579 • Published Dec 15, 2023 • 24

updated a Space about 1 month ago

MobileSAM

🐠

upvoted 2 articles about 1 month ago

Article

Announcing AI Energy Score Ratings

•

Feb 11

• 26

Article

Mastering Long Contexts in LLMs with KVPress

and 1 other •

Jan 23

• 64

upvoted an article about 2 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 808

liked a model 2 months ago

microsoft/phi-4

Text Generation • Updated 18 days ago • 502k • • 1.9k

upvoted a paper 3 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 57

upvoted a collection 3 months ago

Qwen2.5-Coder

Collection

Code-specific model series based on Qwen2.5 • 40 items • Updated Nov 28, 2024 • 292

liked a model 3 months ago

LGAI-EXAONE/EXAONE-3.5-2.4B-Instruct

Text Generation • Updated Dec 11, 2024 • 82.9k • 140

upvoted an article 4 months ago

Article

Introduction to State Space Models (SSM)

•

Jul 19, 2024

• 112

upvoted a paper 4 months ago

RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations

Paper • 2402.17700 • Published Feb 27, 2024 • 2

liked a model 4 months ago

OuteAI/OuteTTS-0.1-350M

Text-to-Speech • Updated Nov 27, 2024 • 3.55k • 300