Skip to content

Conversation

DarkLight1337
Copy link
Member

@DarkLight1337 DarkLight1337 commented Aug 27, 2025

Purpose

Simplify the code since SupportsMultiModalWithRawInput just adds a single flag. Also rename downstream attributes to be more consistent with that of transcription-only models (e.g. supports_transcription_only).

cc @mgazz @christian-pinto

Test Plan

Test Result


Essential Elements of an Effective PR Description Checklist
  • The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
  • The test plan, such as providing test command.
  • The test results, such as pasting the results comparison before and after, or e2e results
  • (Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
  • (Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

@DarkLight1337 DarkLight1337 requested a review from Isotr0py August 27, 2025 13:47
@DarkLight1337 DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 27, 2025
@mergify mergify bot added new-model Requests to new models v1 labels Aug 27, 2025
@DarkLight1337 DarkLight1337 changed the title [Model] Merge SupportsMultiModalWithRawInput with SupportsMultiModal [Model] Merge SupportsMultiModalWithRawInput with SupportsMultiModal Aug 27, 2025
@DarkLight1337 DarkLight1337 added multi-modality Related to multi-modality (#4194) and removed new-model Requests to new models labels Aug 27, 2025
@mergify mergify bot added the new-model Requests to new models label Aug 27, 2025
@DarkLight1337 DarkLight1337 enabled auto-merge (squash) August 27, 2025 13:50
Copy link
Contributor

@rahul-tuli rahul-tuli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice!

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request refactors the multi-modal model interface by merging the SupportsMultiModalWithRawInput protocol into SupportsMultiModal. This simplification is achieved by introducing a new boolean flag, supports_multimodal_raw_input_only, which is a cleaner design. Downstream attributes and methods have been renamed for better clarity and consistency, such as is_multimodal_raw_input_only_model. The changes are applied consistently across all affected files, improving code maintainability. The refactoring is well-executed and I have no concerns.

@mergify mergify bot added the new-model Requests to new models label Aug 27, 2025
@DarkLight1337
Copy link
Member Author

Prithvi tests pass locally so this should be fine to merge

@vllm-bot vllm-bot merged commit 52883ed into vllm-project:main Aug 27, 2025
42 of 45 checks passed
@DarkLight1337 DarkLight1337 deleted the mm-interface branch August 27, 2025 17:01
epwalsh pushed a commit to epwalsh/vllm that referenced this pull request Aug 28, 2025
xiao-llm pushed a commit to xiao-llm/vllm that referenced this pull request Aug 28, 2025
zhewenl pushed a commit to zhewenl/vllm that referenced this pull request Aug 28, 2025
dumb0002 pushed a commit to dumb0002/vllm that referenced this pull request Aug 28, 2025
2015aroras pushed a commit to 2015aroras/vllm that referenced this pull request Aug 29, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
multi-modality Related to multi-modality (#4194) new-model Requests to new models ready ONLY add when PR is ready to merge/full CI is needed v1
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants