Update README.md
Browse files
README.md
CHANGED
@@ -68,7 +68,7 @@ We trained our Bamba model with FSDP using our training repo [here](https://gith
|
|
68 |
Note that this training effort was started before FSDP2 and also long before we contributed
|
69 |
`Mamba2-Hybrid` to HF, so we were doing FSDP1 training with [official Mamba implementation](https://github.com/state-spaces/mamba).
|
70 |
For users trying to reproduce the training you now have much more options with our newly
|
71 |
-
contributed [HF-version of Mamba2-Hybrid]()
|
72 |
|
73 |
|
74 |
## Benchmark scores
|
|
|
68 |
Note that this training effort was started before FSDP2 and also long before we contributed
|
69 |
`Mamba2-Hybrid` to HF, so we were doing FSDP1 training with [official Mamba implementation](https://github.com/state-spaces/mamba).
|
70 |
For users trying to reproduce the training you now have much more options with our newly
|
71 |
+
contributed [HF-version of Mamba2-Hybrid](https://github.com/huggingface/transformers/tree/main/src/transformers/models/bamba).
|
72 |
|
73 |
|
74 |
## Benchmark scores
|