-
Notifications
You must be signed in to change notification settings - Fork 13.6k
CUDA: fix bug in rms_norm fusion #15660
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
@am17an Try just |
ggml-ci
5121760 to
28fc7cc
Compare
|
For |
Ahhh. |
|
The code that triggers the CI is at https://github.com/ggml-org/ci/blob/master/run.sh. It shouldn't be too hard to modify it to monitor all the branches in pull requests using the github API. |
|
Since we can't test the ggml-ci right away, I will proceed to merge this since it fixes some bugs in master. |
* CUDA: fix bug in rms_norm fusion * Fix bug for OP_REPEAT * Fix index for add
Fix bugs pointed out by @ORippler and @CISC in #15631
dst->src[I+1], whereas in repeat we only have 1 src