lunahr
/

SystemGemma2-2b-it

@@ -198,8 +198,9 @@ Data used for model training and how the data was processed.
 ### Training Dataset
 These models were trained on a dataset of text data that includes a wide variety
-of sources. The 27B model was trained with 13 trillion tokens and the 9B model
-was trained with 8 trillion tokens. Here are the key components:
 * Web Documents: A diverse collection of web text ensures the model is exposed
   to a broad range of linguistic styles, topics, and vocabulary. Primarily
@@ -382,7 +383,7 @@ and in brief in the
     <tr>
       <th>Evaluation</th>
       <th>Capability</th>
-      <th>Gemma 2 27B</th>
     </tr>
   </thead>
   <tbody>

 ### Training Dataset
 These models were trained on a dataset of text data that includes a wide variety
+of sources. The 27B model was trained with 13 trillion tokens, the 9B model was
+trained with 8 trillion tokens, and 2B model was trained with 2 trillion tokens.
+Here are the key components:
 * Web Documents: A diverse collection of web text ensures the model is exposed
   to a broad range of linguistic styles, topics, and vocabulary. Primarily
     <tr>
       <th>Evaluation</th>
       <th>Capability</th>
+      <th>Gemma 2 IT 27B</th>
     </tr>
   </thead>
   <tbody>