qanthony-z
commited on
Commit
•
f8e1f70
1
Parent(s):
aedddb4
add mt bench fig
Browse files
README.md
CHANGED
@@ -67,7 +67,7 @@ Zamba2-1.2B-Instruct achieves leading instruction-following and multi-turn chat
|
|
67 |
Moreover, due to its unique hybrid SSM architecture, Zamba2-1.2B-Instruct achieves extremely low inference latency and rapid generation with a significantly smaller memory footprint than comparable transformer-based models.
|
68 |
|
69 |
<center>
|
70 |
-
<img src="https://cdn-uploads.huggingface.co/production/uploads/
|
71 |
</center>
|
72 |
|
73 |
|
|
|
67 |
Moreover, due to its unique hybrid SSM architecture, Zamba2-1.2B-Instruct achieves extremely low inference latency and rapid generation with a significantly smaller memory footprint than comparable transformer-based models.
|
68 |
|
69 |
<center>
|
70 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/65bc13717c6ad1994b6619e9/Q82BVdIppSyqPBHYEAjAl.png" width="700" alt="Zamba performance">
|
71 |
</center>
|
72 |
|
73 |
|