-
Notifications
You must be signed in to change notification settings - Fork 1.8k
Open
Labels
AutoDeploy<NV> AutoDeploy Backend<NV> AutoDeploy Backend
Description
🚀 The feature, motivation and pitch
capture nsys; compare to vLLM and identify issues.
- Create and share a perf table for various configuration points (same as H100 + 8K/16K)
- Share instructions on how to setup vLLM docker on B200
- Share traces
- Find root causes of bad AD perf, if any.
Alternatives
No response
Additional context
capture nsys; compare to vLLM and identify issues.
Before submitting a new issue...
- Make sure you already searched for relevant issues, and checked the documentation and examples for answers to frequently asked questions.
Metadata
Metadata
Assignees
Labels
AutoDeploy<NV> AutoDeploy Backend<NV> AutoDeploy Backend
Type
Projects
Status
In progress