-
-
Notifications
You must be signed in to change notification settings - Fork 10.5k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Config] Remove Unused Environment Variable ONLY add when PR is ready to merge/full CI is needed
v1
VLLM_DISABLE_PAD_FOR_CUDAGRAPH
ready
#26743
opened Oct 13, 2025 by
yewentao256
Loading…
[Easy] Fix env type check errors from VLLM_DEBUG_LOG_API_SERVER_RESPONSE
#26742
opened Oct 13, 2025 by
Jialin
Loading…
3 of 5 tasks
[DO NOT MERGE] test inductor partitioning w/ patch
ci/build
documentation
Improvements or additions to documentation
llama
Related to Llama models
needs-rebase
rocm
Related to AMD ROCm
[Log] Optimize Startup Log
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#26740
opened Oct 13, 2025 by
yewentao256
Loading…
[torch.compile] Unwrap fused_marlin_moe custom op
#26739
opened Oct 13, 2025 by
varun-sundar-rabindranath
Loading…
[DO NOT MERGE] 2.9, Inductor partition unit tests, monkeypatch fix
ci/build
documentation
Improvements or additions to documentation
llama
Related to Llama models
needs-rebase
ready
ONLY add when PR is ready to merge/full CI is needed
rocm
Related to AMD ROCm
#26738
opened Oct 13, 2025 by
ProExpertProg
Loading…
[Core] Streamline some structured output related code
structured-output
tpu
Related to Google TPUs
v1
#26737
opened Oct 13, 2025 by
njhill
Loading…
[Minor] Group async_scheduling related fields in model runner init
v1
#26736
opened Oct 13, 2025 by
njhill
Loading…
[P/D] Dynamic
kv_output_aggregator
collect size
kv-connector
v1
#26734
opened Oct 13, 2025 by
NickLucche
Loading…
[Tests] Varun/marlin experts mk tests
#26733
opened Oct 13, 2025 by
varun-sundar-rabindranath
•
Draft
[UX] Replace VLLM_ALL2ALL_BACKEND with --all2all-backend
documentation
Improvements or additions to documentation
moe
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#26732
opened Oct 13, 2025 by
mgoin
Loading…
5 tasks
granite-4.0-h: fix prefix naming and add AWQ compatibility
#26730
opened Oct 13, 2025 by
toncao
Loading…
4 tasks
[Bugfix] Fix gpt-oss w4a8 DP/EP on B200
gpt-oss
Related to GPT-OSS models
#26729
opened Oct 13, 2025 by
varun-sundar-rabindranath
Loading…
[Feature] Change vllm.py with pydantic validation
kv-connector
v1
#26726
opened Oct 13, 2025 by
VladOS95-cyber
Loading…
5 tasks
[ROCm] enable some tests in entrypoints test groups on AMD
ci/build
rocm
Related to AMD ROCm
#26725
opened Oct 13, 2025 by
Concurrensee
Loading…
Fix lora tests failure in TPU CI due to the removal of LoRA bias
ready
ONLY add when PR is ready to merge/full CI is needed
tpu
Related to Google TPUs
v1
#26723
opened Oct 13, 2025 by
vanbasten23
Loading…
5 tasks
[CI] Raise VLLM_MAX_SIZE_MB to 500 due to failing Build wheel - CUDA 12.9
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
#26722
opened Oct 13, 2025 by
mgoin
Loading…
5 tasks
[Doc] Fix macOS installation dependency resolution issue
documentation
Improvements or additions to documentation
#26721
opened Oct 13, 2025 by
shahfasal
Loading…
3 of 5 tasks
[ROCm][Tools] Add environment variable tuning package for optimized defaults
rocm
Related to AMD ROCm
#26719
opened Oct 13, 2025 by
AndreasKaratzas
Loading…
Adding the test-amd.yaml for test definitions for the AMD backend. (alternative PR)
ci/build
ready
ONLY add when PR is ready to merge/full CI is needed
rocm
Related to AMD ROCm
#26718
opened Oct 13, 2025 by
Alexei-V-Ivanov-AMD
Loading…
[Kernel][Performance] Fuse float cast and renormalize to topk softmax kernel
#26717
opened Oct 13, 2025 by
izhuhaoran
Loading…
5 tasks
[Model] Always use Transformers backend for PaliGemma and Gemma3-MM
documentation
Improvements or additions to documentation
multi-modality
Related to multi-modality (#4194)
new-model
Requests to new models
rocm
Related to AMD ROCm
#26715
opened Oct 13, 2025 by
DarkLight1337
Loading…
5 tasks
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.