sdpa supported?
#7
by
penut85420
- opened
I found that Phi3SdpaAttention
has been implemented, but the attribute _supports_sdpa
is set to false. Why?
The model is optimized for flash attention, and we have not fully tested SDPA yet. We would love to know from your experience. Thank you again for your interest!
nguyenbh
changed discussion status to
closed