BerenMillidge
commited on
Commit
•
e0a875f
1
Parent(s):
85a3ce0
Update README.md
Browse files
README.md
CHANGED
@@ -66,7 +66,7 @@ Zamba2-1.2B utilizes and extends our original Zamba hybrid SSM-attention archite
|
|
66 |
|
67 |
## Performance
|
68 |
|
69 |
-
Zamba2-1.2B achieves leading and state-of-the-art performance among models of <
|
70 |
|
71 |
Zamba2-1.2B's high performance and small inference compute and memory footprint renders it an ideal generalist model for on-device applications.
|
72 |
|
|
|
66 |
|
67 |
## Performance
|
68 |
|
69 |
+
Zamba2-1.2B achieves leading and state-of-the-art performance among models of <2B parameters and is competitive with some models of significantly greater size. Moreover, due to its unique hybrid SSM architecture, Zamba2-1.2B achieves extremely low inference latency and rapid generation with a significantly smaller memory footprint than comparable transformer based models.
|
70 |
|
71 |
Zamba2-1.2B's high performance and small inference compute and memory footprint renders it an ideal generalist model for on-device applications.
|
72 |
|