Can use torch for attention implementation?

by LouiSum - opened May 4, 2023

Discussion

LouiSum

May 4, 2023

Currently the LM is using Triton for attention implementation. Can we change it in config to torch?

madhavatreplit

May 4, 2023

•

edited May 4, 2023

Yes, the model support torch or triton for the attn_impl kwarg, and 'torch' is the default.

So just don't pass in the attn_impl kwarg to the AutoModelFromCausalLM.from_pretrained call and it will default to using the attention implementation in torch!

Lmk if you continue to have trouble with this!

LouiSum

May 5, 2023

It works. Thanks

madhavatreplit changed discussion status to closed May 8, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment