Shyam Sunder Kumar's picture

Open to Collab

Shyam Sunder Kumar

theainerd

·

AI & ML interests

Natural Language Processing

Recent Activity

upvoted a collection 1 day ago

Health AI Developer Foundations (HAI-DEF)

reacted to AdinaY's post with 🚀 1 day ago

AgentCPM-Explore🔥 on device agent foundation model released by OpenBMB https://huggingface.co/openbmb/AgentCPM-Explore ✨ 4B - Apache2.0 ✨ Supports 100+ multi-turn environment interactions with search + verification ✨ Full training/inference stack is openly shared as well

liked a model 7 days ago

nvidia/nemotron-speech-streaming-en-0.6b

View all activity

Organizations

upvoted a collection 1 day ago

Health AI Developer Foundations (HAI-DEF)

Groups models released for use in health AI by Google. Read more about HAI-DEF at http://goo.gle/hai-def • 22 items • Updated 2 days ago • 151

upvoted 4 collections about 1 month ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Dec 4, 2025 • 186

Bhojpuri and Hindi Rural Women ASR

This dataset includes ASR data from rural women speaking Hindi and Bhojpuri, supporting inclusive voice recognition. • 2 items • Updated Nov 6, 2025 • 1

Inference Optimized Checkpoints (with Model Optimizer)

A collection of generative models quantized and optimized for inference with Model Optimizer. • 46 items • Updated about 22 hours ago • 70

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated Dec 2, 2025 • 83

upvoted an article about 2 months ago

Article

Continuous batching from first principles

+1

Nov 25, 2025

•

301

upvoted a paper 2 months ago

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 132

upvoted 2 collections 2 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated Nov 14, 2025 • 162

🎆 October 2025 - China Open Source Highlights

29 items • Updated 5 days ago • 13

upvoted 2 papers 2 months ago

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29, 2025 • 77

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24, 2025 • 60

upvoted a collection 2 months ago

Nemotron RAG

14 items • Updated about 22 hours ago • 59

upvoted a collection 3 months ago

gpt-oss-safeguard

gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated Oct 29, 2025 • 59

upvoted a paper 3 months ago

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 67

upvoted an article 3 months ago

Article

Building the Open Agent Ecosystem Together: Introducing OpenEnv

+8

Oct 23, 2025

•

143

upvoted a collection 3 months ago

🎯 Liquid Nanos

Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices • 26 items • Updated 1 day ago • 105

upvoted an article 3 months ago

Article

Smol2Operator: Post-Training GUI Agents for Computer Use

+3

Sep 23, 2025

•

135

upvoted a collection 3 months ago

GLM-4.6

7 items • Updated Nov 5, 2025 • 51

upvoted a collection 4 months ago

DeepSeek-V3.2

4 items • Updated Dec 1, 2025 • 515

upvoted an article 4 months ago

Article

Gaia2 and ARE: Empowering the community to study agents

+9

Sep 22, 2025

•

125