Community Blog & Articles

NEW 你也可以阅读这篇博客的中文版

Community Articles

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

ML Intern Takes Our Post-Training Internship Test

How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas

Building a Fast Multilingual OCR Model with Synthetic Data

KV Caching Explained: Optimizing Transformer Inference Efficiency

Multilingual Tool Calling in 70+ Languages, On Device

Save the traces! 🐳

mlinter: a linter for Transformers modeling files

Hy3 preview: A Rebuilt Hunyuan, a 21B-Active MoE, and a New Reasoning Receipe

Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

Code a simple RAG from scratch

Mastering Tensor Dimensions in Transformers

Introducing Cohere-transcribe: state-of-the-art speech recognition

How I contributed a new model to the Transformers library using Codex

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

Introducing the Bright Data CLI for Automated Web Data Pipelines

RL: A Structured Human Action & Intent Dataset for Physical AI and World Models

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Measuring What Matters: Objective Metrics for Image Generation Assessment

Gemma 4 VLA Demo on Jetson Orin Nano Super

llmmoelong-context

DeepSeek-V4: a million-token context that agents can actually use

guidetransformers.jsjavascript

How to Use Transformers.js in a Chrome Extension

QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard

cybersecurityopen-sourcecommunity

AI and the Future of Cybersecurity: Why Openness Matters

reinforcement-learningrlvre-commerce

Ecom-RLVE: Adaptive Verifiable Environments for E-Commerce Conversational Agents

announcementmlxllm

The PR you would have opened yourself

multimodalnlpcommunity

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents

Meet HoloTab by HCompany. Your AI browser companion.

announcementdiffusionworld-model

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

+1

multimodalnlpcommunity

Multimodal Embedding & Reranker Models with Sentence Transformers

open-source-collabpartnershipsopen-source

Safetensors is Joining the PyTorch Foundation

multimodalon-devicegemma4

Welcome Gemma 4: Frontier multimodal intelligence on device

+3

Community Articles

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

ML Intern Takes Our Post-Training Internship Test

How to Ground a Korean AI Agent in Real Demographics with Synthetic Personas

Building a Fast Multilingual OCR Model with Synthetic Data

KV Caching Explained: Optimizing Transformer Inference Efficiency

Multilingual Tool Calling in 70+ Languages, On Device

Save the traces! 🐳

mlinter: a linter for Transformers modeling files

Hy3 preview: A Rebuilt Hunyuan, a 21B-Active MoE, and a New Reasoning Receipe

Darwin-TTS: We Gave a TTS Model 3% of an LLM's Brain — It Started Showing Emotion

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

Code a simple RAG from scratch

Mastering Tensor Dimensions in Transformers

Introducing Cohere-transcribe: state-of-the-art speech recognition

How I contributed a new model to the Transformers library using Codex

How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs

Introducing the Bright Data CLI for Automated Web Data Pipelines

RL: A Structured Human Action & Intent Dataset for Physical AI and World Models

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Measuring What Matters: Objective Metrics for Image Generation Assessment

View all articles