Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
sergiopaniegoΒ 
posted an update 11 days ago
Post
4173
gpt-oss was possible thanks to new engineering efforts in πŸ€— transformers. We just dropped a blog covering them:

- Kernels from the Hub
- MXFP4 Quantization
- Tensor & Expert Parallelism
- Dynamic Sliding Window & Cache
- Continuous Batching & Paged Attention

Grab a coffee & dive in! β˜•οΈ

https://huggingface.co/blog/faster-transformers
In this post