-
-
Notifications
You must be signed in to change notification settings - Fork 9.8k
[Model] Merge SupportsMultiModalWithRawInput
with SupportsMultiModal
#23749
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: DarkLight1337 <[email protected]>
SupportsMultiModalWithRawInput
with SupportsMultiModal
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request refactors the multi-modal model interface by merging the SupportsMultiModalWithRawInput
protocol into SupportsMultiModal
. This simplification is achieved by introducing a new boolean flag, supports_multimodal_raw_input_only
, which is a cleaner design. Downstream attributes and methods have been renamed for better clarity and consistency, such as is_multimodal_raw_input_only_model
. The changes are applied consistently across all affected files, improving code maintainability. The refactoring is well-executed and I have no concerns.
Prithvi tests pass locally so this should be fine to merge |
…al` (vllm-project#23749) Signed-off-by: DarkLight1337 <[email protected]>
…al` (vllm-project#23749) Signed-off-by: DarkLight1337 <[email protected]>
…al` (vllm-project#23749) Signed-off-by: DarkLight1337 <[email protected]>
…al` (vllm-project#23749) Signed-off-by: DarkLight1337 <[email protected]>
…al` (vllm-project#23749) Signed-off-by: DarkLight1337 <[email protected]>
Purpose
Simplify the code since
SupportsMultiModalWithRawInput
just adds a single flag. Also rename downstream attributes to be more consistent with that of transcription-only models (e.g.supports_transcription_only
).cc @mgazz @christian-pinto
Test Plan
Test Result
Essential Elements of an Effective PR Description Checklist
supported_models.md
andexamples
for a new model.