Zyphra
/

Zamba2-1.2B

transformers_zamba2

Model card Files Files and versions Community

BerenMillidge commited on Aug 24

Commit

e0a875f

•

1 Parent(s): 85a3ce0

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -66,7 +66,7 @@ Zamba2-1.2B utilizes and extends our original Zamba hybrid SSM-attention archite
 ## Performance
-Zamba2-1.2B achieves leading and state-of-the-art performance among models of <3B parameters and is competitive with some models of significantly greater size. Moreover, due to its unique hybrid SSM architecture, Zamba2-1.2B achieves extremely low inference latency and rapid generation with a significantly smaller memory footprint than comparable transformer based models.
 Zamba2-1.2B's high performance and small inference compute and memory footprint renders it an ideal generalist model for on-device applications.

 ## Performance
+Zamba2-1.2B achieves leading and state-of-the-art performance among models of <2B parameters and is competitive with some models of significantly greater size. Moreover, due to its unique hybrid SSM architecture, Zamba2-1.2B achieves extremely low inference latency and rapid generation with a significantly smaller memory footprint than comparable transformer based models.
 Zamba2-1.2B's high performance and small inference compute and memory footprint renders it an ideal generalist model for on-device applications.