sdpa supported?

#7
by penut85420 - opened

I found that Phi3SdpaAttention has been implemented, but the attribute _supports_sdpa is set to false. Why?

Microsoft org

The model is optimized for flash attention, and we have not fully tested SDPA yet. We would love to know from your experience. Thank you again for your interest!

nguyenbh changed discussion status to closed

Sign up or log in to comment