rinna
/

nekomata-14b

@@ -44,6 +44,10 @@ The name `nekomata` comes from the Japanese word [`猫又/ねこまた/Nekomata`
     - [Wikipedia](https://dumps.wikimedia.org/other/cirrussearch)
     - rinna curated Japanese dataset
 * **Authors**
     - [Tianyu Zhao](https://huggingface.co/tianyuz)
@@ -117,7 +121,7 @@ We compared the `Qwen` tokenizer (as used in `nekomata`) and the `llama-2` token
 @misc{RinnaNekomata14b,
     url={https://huggingface.co/rinna/nekomata-14b},
     title={rinna/nekomata-14b},
-    author={Zhao, Tianyu and Kaga, Akio and Wakatsuki, Toshiaki and Sawada, Kei}
 }
 ~~~
 ---

     - [Wikipedia](https://dumps.wikimedia.org/other/cirrussearch)
     - rinna curated Japanese dataset
+* **Training Infrastructure**
+    `nekomata-14B` was trained on 16 nodes of Amazon EC2 trn1.32xlarge instance powered by AWS Trainium purpose-built ML accelerator chip. The pre-training job was completed within a timeframe of approximately 7 days.
 * **Authors**
     - [Tianyu Zhao](https://huggingface.co/tianyuz)
 @misc{RinnaNekomata14b,
     url={https://huggingface.co/rinna/nekomata-14b},
     title={rinna/nekomata-14b},
+    author={Zhao, Tianyu and Kaga, Akio and Sawada, Kei}
 }
 ~~~
 ---