Update README.md
Browse files
README.md
CHANGED
@@ -64,7 +64,7 @@ print(tokenizer.batch_decode(response, skip_special_tokens=True)[0])
|
|
64 |
|
65 |
## Training
|
66 |
|
67 |
-
We trained our Bamba model with FSDP using our training repo [here](https://github.com/foundation-model-stack/fms-fsdp
|
68 |
Note that this training effort was started before FSDP2 and also long before we contributed
|
69 |
`Mamba2-Hybrid` to HF, so we were doing FSDP1 training with [official Mamba implementation](https://github.com/state-spaces/mamba).
|
70 |
For users trying to reproduce the training you now have much more options with our newly
|
|
|
64 |
|
65 |
## Training
|
66 |
|
67 |
+
We trained our Bamba model with FSDP using our training repo [here](https://github.com/foundation-model-stack/fms-fsdp).
|
68 |
Note that this training effort was started before FSDP2 and also long before we contributed
|
69 |
`Mamba2-Hybrid` to HF, so we were doing FSDP1 training with [official Mamba implementation](https://github.com/state-spaces/mamba).
|
70 |
For users trying to reproduce the training you now have much more options with our newly
|