Explicit Expansion in MHAEinsum is Redundant #772
-
PyTorch's The Ch03_MHA approach is more efficient because:
The MHAEinsum expansion step can be simplified to match Ch03_MHA's approach without any functional difference. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Thanks for the suggestion! I agree, this is more efficient, and definitely more elegant. I don't know why I had issues with it before. I'm updating it via #775 |
Beta Was this translation helpful? Give feedback.
Thanks for the suggestion! I agree, this is more efficient, and definitely more elegant. I don't know why I had issues with it before. I'm updating it via #775