Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Bugfix] when nixi port by bind, process canot stop
#23756 opened Aug 27, 2025 by lengrongfu Loading…
5 tasks
Support for NemotronH Nano VLM with an optimized vision model (vLLM native) multi-modality Related to multi-modality (#4194) new-model Requests to new models
#23753 opened Aug 27, 2025 by danielafrimi Draft
[Doc]: Spelling errors fixed in .md files ci/build documentation Improvements or additions to documentation needs-rebase performance Performance-related issues
#23751 opened Aug 27, 2025 by didier-durand Loading…
1 task done
[Model] Merge SupportsMultiModalWithRawInput with SupportsMultiModal multi-modality Related to multi-modality (#4194) new-model Requests to new models ready ONLY add when PR is ready to merge/full CI is needed v1
#23749 opened Aug 27, 2025 by DarkLight1337 Loading…
5 tasks
Tune configs for triton block fp8 gemm H100/H200 performance Performance-related issues
#23748 opened Aug 27, 2025 by mgoin Loading…
5 tasks
[Docs] Fix warnings in mkdocs build (continued) ready ONLY add when PR is ready to merge/full CI is needed structured-output tpu Related to Google TPUs v1
#23743 opened Aug 27, 2025 by Zerohertz Loading…
Adapting Qwen3-32B to Eagle3 mode to resolve head dimension mismatch issues qwen Related to Qwen models v1
#23740 opened Aug 27, 2025 by coder-fny Loading…
5 tasks
Disable torch.compile for dynamic rope models in Transformers backend documentation Improvements or additions to documentation new-model Requests to new models ready ONLY add when PR is ready to merge/full CI is needed
#23738 opened Aug 27, 2025 by hmellor Loading…
[Misc] Extract common utils for nvfp4 kernel source files
#23727 opened Aug 27, 2025 by elvischenv Loading…
5 tasks
[Misc] Removed force_fp8_e4m3fnuz from FP8LinearOp
#23725 opened Aug 27, 2025 by nvjullin Loading…
5 tasks
[Frontend] Pass API server count to each process documentation Improvements or additions to documentation frontend multi-modality Related to multi-modality (#4194) ready ONLY add when PR is ready to merge/full CI is needed v1
#23717 opened Aug 27, 2025 by DarkLight1337 Loading…
1 of 5 tasks
[ROCm][FEAT] Integrate AITER tgemm. needs-rebase rocm Related to AMD ROCm
#23712 opened Aug 27, 2025 by vllmellm Draft
5 tasks
[Bugfix] when set offline model running error frontend
#23711 opened Aug 27, 2025 by lengrongfu Loading…
5 tasks
[WIP] Adding int4 models for CPU benchmarking ci/build performance Performance-related issues
#23709 opened Aug 27, 2025 by louie-tsai Loading…
5 tasks
ProTip! Updated in the last three days: updated:>2025-08-24.