Skip to content

Commit 5c4e03f

Browse files
noemotiovonrasmith
authored andcommitted
[Misc]Minor Changes about Worker (vllm-project#11555)
Signed-off-by: Chenguang Li <[email protected]>
1 parent d372804 commit 5c4e03f

File tree

2 files changed

+0
-2
lines changed

2 files changed

+0
-2
lines changed

vllm/v1/worker/gpu_worker.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -132,7 +132,6 @@ def determine_num_available_blocks(self) -> Tuple[int, int]:
132132
# Execute a forward pass with dummy inputs to profile the memory usage
133133
# of the model.
134134
self.model_runner.profile_run()
135-
torch.cuda.synchronize()
136135

137136
free_gpu_memory, _ = torch.cuda.mem_get_info()
138137
# NOTE(woosuk): Here we assume that the other processes using the same

vllm/worker/worker.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -200,7 +200,6 @@ def determine_num_available_blocks(self) -> Tuple[int, int]:
200200
weights_memory_in_bytes=self.model_runner.
201201
model_memory_usage) as result:
202202
self.model_runner.profile_run()
203-
torch.cuda.synchronize()
204203

205204
self._assert_memory_footprint_increased_during_profiling()
206205

0 commit comments

Comments
 (0)