-
Notifications
You must be signed in to change notification settings - Fork 27
Pull requests: vllm-project/vllm-gaudi
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix warmup break when max decode bucket bs > max num seq
#107
opened Aug 26, 2025 by
taran2210
Loading…
Enable Spec Decode for HPU v1 - Part1(basic workflow + eagle)
#81
opened Aug 15, 2025 by
xuechendi
Loading…
Enable LMCache for cpuoffloading, LMCache docker support, enable lmcache
#64
opened Aug 6, 2025 by
hsubramony
Loading…
Add graph compilation tracking to high level profiler
#50
opened Jul 28, 2025 by
kzawora-intel
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.