RuntimeError: FlashAttention only supports Ampere GPUs or newer.

#11
by pravinkarpe - opened

Got the below error when running the model in the colab notebook.
RuntimeError: FlashAttention only supports Ampere GPUs or newer.

Microsoft org
nguyenbh changed discussion status to closed

Sign up or log in to comment