recursal
/

QRWKV6-32B-Instruct-Preview-v0.1

@@ -37,7 +37,7 @@ This approach demonstrates the architecture design and scalability of RWKV, rein
 One downside to this technique is that the model's inherent knowledge and dataset training are inherited from its "parent" model. Consequently, unlike previous RWKV models trained on over 100+ languages, the QRWKV model is limited to approximately 30 languages supported by the Qwen line of models.
-Due to the the lack of RWKV-based channel mix and feedforward layers, seperate inference code is needed for this specific model.
 Furthermore, due to compute constraints, we were only able to train up to 16K token context length. While the model is stable beyond this limit, additional training might be required to support longer context lengths.
@@ -53,8 +53,8 @@ Lastly, we intend to provide details on the conversion along with our paper afte
 ## Links
 - [Our wiki](https://wiki.rwkv.com)
 - [TensorWave - The AMD Cloud](https://tensorwave.com) - Access MI300X today!
-- [Recursal.AI Cloud Platform](https://recursal.ai)
-- [Featherless Inference](https://featherless.ai/models/RWKV/)
 ## Acknowledgement
 We are grateful for the help and support from the following key groups:

 One downside to this technique is that the model's inherent knowledge and dataset training are inherited from its "parent" model. Consequently, unlike previous RWKV models trained on over 100+ languages, the QRWKV model is limited to approximately 30 languages supported by the Qwen line of models.
+Due to the the lack of RWKV-based channel mix and feedforward layers, separate inference code is needed for this specific model.
 Furthermore, due to compute constraints, we were only able to train up to 16K token context length. While the model is stable beyond this limit, additional training might be required to support longer context lengths.
 ## Links
 - [Our wiki](https://wiki.rwkv.com)
 - [TensorWave - The AMD Cloud](https://tensorwave.com) - Access MI300X today!
+- [Recursal.AI Cloud Platform](https://platform.recursal.ai)
+- [Featherless Inference](https://featherless.ai/model-families/rwkv6/)
 ## Acknowledgement
 We are grateful for the help and support from the following key groups: