lunahr
/

SystemGemma2-9b-it

@@ -274,8 +274,8 @@ Data used for model training and how the data was processed.
 ### Training Dataset
-These models were trained on a dataset of text data that includes a wide variety
-of sources, totaling 15 trillion tokens. Here are the key components:
 * Web Documents: A diverse collection of web text ensures the model is exposed
   to a broad range of linguistic styles, topics, and vocabulary. Primarily

 ### Training Dataset
+These models were trained on a dataset of text data that includes a wide variety of sources. The 27B model was trained with 13 trillion tokens and the 9B model was trained with 8 trillion tokens.
+Here are the key components:
 * Web Documents: A diverse collection of web text ensures the model is exposed
   to a broad range of linguistic styles, topics, and vocabulary. Primarily