Pratyay Banerjee's picture

In a Training Loop 🔄

Pratyay Banerjee

Neilblaze

·

https://neilblaze.live

AI & ML interests

IR, NLP, Pattern Recognition, xAI, Interpretability, Evals

Recent Activity

liked a model about 4 hours ago

google/diffusiongemma-26B-A4B-it

upvoted a paper about 15 hours ago

OpenSkill: Open-World Self-Evolution for LLM Agents

upvoted a paper about 15 hours ago

Rethinking the Divergence Regularization in LLM RL

View all activity

Organizations

liked a model about 4 hours ago

google/diffusiongemma-26B-A4B-it

Image-Text-to-Text • 26B • Updated 1 day ago • 504

upvoted 7 papers about 15 hours ago

OpenSkill: Open-World Self-Evolution for LLM Agents

Paper • 2606.06741 • Published 8 days ago • 26

Rethinking the Divergence Regularization in LLM RL

Paper • 2606.09821 • Published 4 days ago • 28

SearchSwarm: Towards Delegation Intelligence in Agentic LLMs for Long-Horizon Deep Research

Paper • 2606.09730 • Published 4 days ago • 49

Retrospective Harness Optimization: Improving LLM Agents via Self-Preference over Trajectory Rollouts

Paper • 2606.05922 • Published 8 days ago • 52

LatentSkill: From In-Context Textual Skills to In-Weight Latent Skills for LLM Agents

Paper • 2606.06087 • Published 8 days ago • 59

KVarN: Variance-Normalized KV-Cache Quantization Mitigates Error Accumulation in Reasoning Tasks

Paper • 2606.03458 • Published 10 days ago • 60

Agents' Last Exam

Paper • 2606.05405 • Published 9 days ago • 295

upvoted an article 1 day ago

Article

Introducing North Mini Code: Cohere’s First Model For Developers

CohereLabs

•

2 days ago

• 57

liked a model 1 day ago

CohereLabs/North-Mini-Code-1.0-fp8

Text Generation • 31B • Updated 2 days ago • 29 • 16

liked a dataset 1 day ago

nanotron/ultrascale-playbook-data

Updated Mar 12, 2025 • 668 • 8

liked a model 1 day ago

HuggingFaceTB/SmolLM2-135M

Text Generation • 0.1B • Updated Feb 6, 2025 • 1.39M • 199

liked a Space 2 days ago

Open SLM Leaderboard

Open Small Language Model Leaderboard

liked a model 2 days ago

XiaomiMiMo/MiMo-V2.5-Pro-FP4-DFlash

Text Generation • 554B • Updated 4 days ago • 660 • 87

upvoted a collection 3 days ago

Deepseek Papers

Deepseek papers collection • 31 items • Updated 4 days ago • 350

liked 3 models 4 days ago

byteshape/Qwen3.6-35B-A3B-MTP-GGUF

Image-Text-to-Text • 36B • Updated 23 days ago • 36.1k • 60

nvidia/nemotron-3.5-asr-streaming-0.6b

Automatic Speech Recognition • Updated 6 days ago • 4.97k • • 372

mlx-community/gemma-4-12B-it-8bit

Image-Text-to-Text • 3B • Updated 4 days ago • 41.8k • 31

upvoted a collection 5 days ago

Laguna XS.2

Designed for agentic coding and long-horizon work on a local machine. Apache 2.0. • 5 items • Updated May 7 • 25

upvoted a collection 6 days ago

Gemma 4 QAT

Gemma 4 QAT (Quantization-Aware Training) for 3x less memory use and near original accuracy. • 16 items • Updated 6 days ago • 81