社区博客与文章

Community Articles

基于 Transformers 的编码器-解码器模型

Reformer 模型 - 突破语言建模的极限

如何生成文本：通过 Transformers 用不同的解码方法生成文本

NEW Articles from Team or Enterprise organizations will get promoted to the main section.

社区博客与文章

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages

EMO: Pretraining mixture of experts for emergent modularity

Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law

KV Caching Explained: Optimizing Transformer Inference Efficiency

Uncensor any LLM with abliteration

How to Comply with SOC 2 and ISO 27001 with Hugging Face: A Practical Guide to AI Model Supply Chain Governance

Software Forgets: Agent Traces Are the Memory

Code a simple RAG from scratch

Small Language Models (SLM): A Comprehensive Overview

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

NEO-unify: Building Native Multimodal Unified Models End to End

Mastering Tensor Dimensions in Transformers

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Norm-Preserving Biprojected Abliteration

Forge: Scalable Agent RL Framework and Algorithm

LLM Architectures Explained: What Powers Today’s Top Models

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

Talking to a 4-Year-Old: A Multilingual Benchmark for Children's AI Companions

SSE Retrieval MRL v2: Regularization of Representation Space and Performance Improvement via Hyperparameter Optimization

基于 Transformers 的编码器-解码器模型

Reformer 模型 - 突破语言建模的极限

如何生成文本：通过 Transformers 用不同的解码方法生成文本

Training-Free Reasoning at 88.89% on GPQA Diamond: How Darwin Family Hit Frontier Scores Without a Single Gradient Step

Vividh-ASR: Diagnosing and Fixing Studio-Bias in Whisper for Indic Languages

EMO: Pretraining mixture of experts for emergent modularity

Two Years of Local AI on a Laptop: When Open Models Outpaced Moore's Law

KV Caching Explained: Optimizing Transformer Inference Efficiency

Uncensor any LLM with abliteration

How to Comply with SOC 2 and ISO 27001 with Hugging Face: A Practical Guide to AI Model Supply Chain Governance

Software Forgets: Agent Traces Are the Memory

Code a simple RAG from scratch

Small Language Models (SLM): A Comprehensive Overview

A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond

NEO-unify: Building Native Multimodal Unified Models End to End

Mastering Tensor Dimensions in Transformers

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

Norm-Preserving Biprojected Abliteration

Forge: Scalable Agent RL Framework and Algorithm

LLM Architectures Explained: What Powers Today’s Top Models

NVIDIA Isaac GR00T N1.7: Open Reasoning VLA Model for Humanoid Robots

Talking to a 4-Year-Old: A Multilingual Benchmark for Children's AI Companions

SSE Retrieval MRL v2: Regularization of Representation Space and Performance Improvement via Hyperparameter Optimization