edwko commited on
Commit
663f724
·
verified ·
1 Parent(s): 620431e

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -129,7 +129,7 @@ In this process, the CNT solar cells generate a tiny amount of power, but when t
129
  ## Usage with HuggingFace transformers
130
  Model weights were converted from the original Mamba2 implementation to be Hugging Face compatible. <br>
131
  Due to the lack of official support for Mamba2 attention layers in Hugging Face Transformers, custom modeling files are included. <br>
132
- The attention layer implementation for the modeling files are based on the work from Pull Request #32027 in the Hugging Face Transformers repository: [https://github.com/huggingface/transformers/pull/32027](https://github.com/huggingface/transformers/pull/32027)
133
 
134
  To speed up inference, we recommend installing mamba-ssm and flash attention 2.
135
 
 
129
  ## Usage with HuggingFace transformers
130
  Model weights were converted from the original Mamba2 implementation to be Hugging Face compatible. <br>
131
  Due to the lack of official support for Mamba2 attention layers in Hugging Face Transformers, custom modeling files are included. <br>
132
+ The implementation of Mamba2 with attention in the modeling files comes from Pull Request #32027 in the Hugging Face Transformers repository: [https://github.com/huggingface/transformers/pull/32027](https://github.com/huggingface/transformers/pull/32027)
133
 
134
  To speed up inference, we recommend installing mamba-ssm and flash attention 2.
135