Tatako Felici's picture

1 24

Tatako Felici

tatakof

·

AI & ML interests

Transformers, Reinforcement Learning, Differentiable Programming

Recent Activity

updated a model 28 days ago

sandbox-ai/Llama-3.1-Tango-8b-Instruct-f16

updated a model 28 days ago

sandbox-ai/Llama-3.1-Tango-70b-bnb_4b

updated a model about 2 months ago

sandbox-ai/Llama-3.1-Tango-8b-Instruct-int4

View all activity

Organizations

tatakof's activity

updated 2 models 28 days ago

sandbox-ai/Llama-3.1-Tango-8b-Instruct-f16

Text Generation • Updated 28 days ago • 32 • 1

sandbox-ai/Llama-3.1-Tango-70b-bnb_4b

Text Generation • Updated 28 days ago • 6

updated 6 models about 2 months ago

sandbox-ai/Llama-3.1-Tango-8b-Instruct-int4

Text Generation • Updated Dec 9, 2024 • 76

sandbox-ai/Llama-3.1-Tango-8b-Instruct-adapter

Updated Dec 9, 2024

sandbox-ai/Llama-3.1-Tango-8b-f16

Text Generation • Updated Dec 8, 2024 • 6

sandbox-ai/Llama-3.1-Tango-8b-int4

Text Generation • Updated Dec 8, 2024 • 76

sandbox-ai/Llama-3.1-Tango-8b-adapter

Updated Dec 7, 2024

tatakof/Llama-3.1-Tango-8b

Updated Dec 7, 2024

updated 2 models 3 months ago

sandbox-ai/Llama-3.1-Tango-70b-GGUF

Updated Nov 11, 2024 • 4

sandbox-ai/Llama-3.1-Tango-70b

Text Generation • Updated Nov 6, 2024 • 28 • 7

liked a model 3 months ago

sandbox-ai/Llama-3.1-Tango-70b

Text Generation • Updated Nov 6, 2024 • 28 • 7

updated 2 datasets 3 months ago

tatakof/messi_mod-v0.0.2

Viewer • Updated Oct 20, 2024 • 15.7k • 43

tatakof/messi_mod-v0.0.1

Viewer • Updated Oct 19, 2024 • 395k • 43

liked a model 5 months ago

meta-llama/Llama-3.1-70B-Instruct

Text Generation • Updated Dec 15, 2024 • 373k • 779

updated a model 8 months ago

tatakof/distillbert-base-spanish-uncased_finetuned_with-Llama2-Knowledge-Distillation

Fill-Mask • Updated May 27, 2024 • 108

liked a model 9 months ago

RWKV/v5-Eagle-7B-HF

Text Generation • Updated Feb 25, 2024 • 6.69k • 71

reacted to ordagan's post with 🔥 10 months ago

Post

2175

Excited to introduce Jamba by AI21
ai21labs/Jamba-v0.1

We are thrilled to announce Jamba, the world’s first production-grade Mamba based model.

Key Features:
- First production-grade Mamba based model built on a novel SSM-Transformer hybrid architecture
- 3X throughput on long contexts compared to Mixtral 8x7B
- Democratizes access to a massive 256K context window
- The only model in its size class that fits up to 140K context on a single GPU

Jamba is based on a novel architecture that combines Mamba and Transformer. While our initial results show great efficiency gains, we expect this to be further explored and improved with the help of the community.

Check out our blog post for more info: https://ai21-labs.webflow.io/blog/announcing-jamba

2 replies

·

liked a model 11 months ago

google/mobilebert-uncased

Updated Apr 19, 2021 • 347k • 47

liked a model 12 months ago

timinar/baby-llama-58m

Text Generation • Updated Oct 23, 2023 • 183 • 10

liked a Space 12 months ago

InstantID