Any plan for W4A4 INT4 quantization? #11705
AI-bot-easy
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I see some 4-bit W4A4 FP4 quantization, but those should work only on NV Blackwell now?
Is there anything about INT4 W4A4, existing support, plan, or plan not?
Beta Was this translation helpful? Give feedback.
All reactions