Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Config] Remove Unused Environment Variable VLLM_DISABLE_PAD_FOR_CUDAGRAPH ready ONLY add when PR is ready to merge/full CI is needed v1
#26743 opened Oct 13, 2025 by yewentao256 Loading…
[Easy] Fix env type check errors from VLLM_DEBUG_LOG_API_SERVER_RESPONSE
#26742 opened Oct 13, 2025 by Jialin Loading…
3 of 5 tasks
[DO NOT MERGE] test inductor partitioning w/ patch ci/build documentation Improvements or additions to documentation llama Related to Llama models needs-rebase rocm Related to AMD ROCm
#26741 opened Oct 13, 2025 by angelayi Draft
[Log] Optimize Startup Log ready ONLY add when PR is ready to merge/full CI is needed v1
#26740 opened Oct 13, 2025 by yewentao256 Loading…
[DO NOT MERGE] 2.9, Inductor partition unit tests, monkeypatch fix ci/build documentation Improvements or additions to documentation llama Related to Llama models needs-rebase ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#26738 opened Oct 13, 2025 by ProExpertProg Loading…
[Core] Streamline some structured output related code structured-output tpu Related to Google TPUs v1
#26737 opened Oct 13, 2025 by njhill Loading…
[UX] Replace VLLM_ALL2ALL_BACKEND with --all2all-backend documentation Improvements or additions to documentation moe ready ONLY add when PR is ready to merge/full CI is needed v1
#26732 opened Oct 13, 2025 by mgoin Loading…
5 tasks
Enable Blackwell Llama4 MoE tests llama Related to Llama models
#26731 opened Oct 13, 2025 by mgoin Draft
5 tasks
granite-4.0-h: fix prefix naming and add AWQ compatibility
#26730 opened Oct 13, 2025 by toncao Loading…
4 tasks
[Bugfix] Fix gpt-oss w4a8 DP/EP on B200 gpt-oss Related to GPT-OSS models
#26729 opened Oct 13, 2025 by varun-sundar-rabindranath Loading…
[ROCm] enable some tests in entrypoints test groups on AMD ci/build rocm Related to AMD ROCm
#26725 opened Oct 13, 2025 by Concurrensee Loading…
Fix lora tests failure in TPU CI due to the removal of LoRA bias ready ONLY add when PR is ready to merge/full CI is needed tpu Related to Google TPUs v1
#26723 opened Oct 13, 2025 by vanbasten23 Loading…
5 tasks
[CI] Raise VLLM_MAX_SIZE_MB to 500 due to failing Build wheel - CUDA 12.9 ci/build ready ONLY add when PR is ready to merge/full CI is needed
#26722 opened Oct 13, 2025 by mgoin Loading…
5 tasks
[Doc] Fix macOS installation dependency resolution issue documentation Improvements or additions to documentation
#26721 opened Oct 13, 2025 by shahfasal Loading…
3 of 5 tasks
Adding the test-amd.yaml for test definitions for the AMD backend. (alternative PR) ci/build ready ONLY add when PR is ready to merge/full CI is needed rocm Related to AMD ROCm
#26718 opened Oct 13, 2025 by Alexei-V-Ivanov-AMD Loading…
[Model] Always use Transformers backend for PaliGemma and Gemma3-MM documentation Improvements or additions to documentation multi-modality Related to multi-modality (#4194) new-model Requests to new models rocm Related to AMD ROCm
#26715 opened Oct 13, 2025 by DarkLight1337 Loading…
5 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.