Tatako Felici's picture
1 24

Tatako Felici

tatakof
Β·

AI & ML interests

Transformers, Reinforcement Learning, Differentiable Programming

Recent Activity

updated a model 14 days ago
sandbox-ai/Llama-3.1-Tango-70b-GGUF
updated a model 24 days ago
sandbox-ai/Llama-3.1-Tango-70b
liked a model 28 days ago
sandbox-ai/Llama-3.1-Tango-70b
View all activity

Organizations

tatakof's activity

Reacted to ordagan's post with πŸ”₯ 8 months ago
view post
Post
2161
Excited to introduce Jamba by AI21
ai21labs/Jamba-v0.1

We are thrilled to announce Jamba, the world’s first production-grade Mamba based model.

Key Features:
- First production-grade Mamba based model built on a novel SSM-Transformer hybrid architecture
- 3X throughput on long contexts compared to Mixtral 8x7B
- Democratizes access to a massive 256K context window
- The only model in its size class that fits up to 140K context on a single GPU

Jamba is based on a novel architecture that combines Mamba and Transformer. While our initial results show great efficiency gains, we expect this to be further explored and improved with the help of the community.

Check out our blog post for more info: https://ai21-labs.webflow.io/blog/announcing-jamba
  • 2 replies
Β·
liked a Space 12 months ago