feat: support early exit (#459) #481

Forsworns · 2025-10-27T02:22:29Z

Description

Add two bpf helper functions for CUDA.
Add an early-exit demo in CUDA examples.
Fixes [FEATURE] CUDA kernel early exit demo #459

Type of change

New feature (non-breaking change which adds functionality)

How Has This Been Tested?

Tested via the provided demo.

Forsworns · 2025-10-27T11:28:16Z

I just found there were some typos in the README and the binary was included.

yunwei37 · 2025-10-27T12:25:27Z

Thanks a lot!

Copilot

Pull Request Overview

This PR adds early exit functionality for CUDA kernels through eBPF helpers, enabling kernel atomization capabilities similar to network packet filtering. It introduces two new BPF helper functions (bpf_cuda_exit and bpf_get_grid_dim) and provides a complete demonstration through a vector addition example with partition-based execution control.

Key changes:

Two new BPF helper functions (507: exit, 508: get_grid_dim) for CUDA kernel control
Complete atomizer example with partition-based block filtering
PTX-level early exit implementation via inline assembly

Reviewed Changes

Copilot reviewed 11 out of 12 changed files in this pull request and generated 6 comments.

Show a summary per file

File	Description
attach/nv_attach_impl/trampoline/default_trampoline.cu	Implements the two new BPF helper functions in CUDA
attach/nv_attach_impl/trampoline_ptx.h	Adds PTX assembly for the new helper functions
attach/nv_attach_impl/nv_attach_impl_patcher.cpp	Registers the new helper functions (507 and 508)
example/gpu/atomizer/atomizer.bpf.c	eBPF program implementing partition-based kernel atomization
example/gpu/atomizer/atomizer.c	Userspace loader for the eBPF atomizer program
example/gpu/atomizer/vec_add.cu	CUDA vector addition demo application
example/gpu/atomizer/main.ptx	Generated PTX assembly from the demo
example/gpu/atomizer/filter_hashtag.py	Utility script to filter preprocessor directives
example/gpu/atomizer/README.md	Documentation for the atomizer example
example/gpu/atomizer/Makefile	Build configuration for the atomizer example
example/gpu/atomizer/.gitignore	Git ignore rules for build artifacts

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

example/gpu/atomizer/vec_add.cu

example/gpu/atomizer/README.md

Forsworns · 2025-10-27T13:54:03Z

I have addressed typos. But I found another problem:

when I set the launching configuration in vector_add.cu to vectorAdd<<<10, 1>>>(d_A, d_B, d_C);,
and try to read the pre-configured partition number/index from the BPF maps in the atomizer.bpf.c, it sometimes returns null pointer for the given key. I'm not sure where the problem is.

Currently, I only launch a single block in the vector_add.c and it works well. But then only part of threads are exited, instead of the whole thread blocks. Thus, the semantic is different from the LithOS. :(

yunwei37 · 2025-10-30T14:12:18Z

I think we need to provide new attach types instead of function probes to support that semantic. That's not hard as we have seperated the attach types into passes?

yunwei37 · 2025-10-30T14:12:58Z

Maybe we can merge that and continue future work on next PR? @Officeyutong @Sy0307 @Forsworns

Forsworns · 2025-10-31T10:26:54Z

Maybe we can merge that and continue future work on next PR? @Officeyutong @Sy0307 @Forsworns

I'm fine about the attach type. But I'm still bothered by the above BPF map issue, do you have any ideas?

- Add two bpf helper functions for CUDA. - Add an early-exit demo in CUDA examples. Close eunomia-bpf#459 Signed-off-by: Forsworns <[email protected]>

Sy0307 · 2025-10-31T10:56:37Z

Maybe I can review it later for BPF map issue? Can you give a more detailed description?

Forsworns · 2025-11-01T03:27:42Z

Maybe I can review it later for BPF map issue? Can you give a more detailed description?

@Sy0307 I just opened a new issue in #486 and provide two examples. I guess it is related to the synchronize between host and device.

pull-request-size bot added the size/XXL label Oct 27, 2025

Officeyutong approved these changes Oct 27, 2025

View reviewed changes

yunwei37 requested a review from Copilot October 27, 2025 12:25

Copilot AI reviewed Oct 27, 2025

View reviewed changes

Forsworns force-pushed the atomizer branch 2 times, most recently from 0aa403c to e9d2be3 Compare October 27, 2025 13:45

feat: support early exit (eunomia-bpf#459)

d81be72

- Add two bpf helper functions for CUDA. - Add an early-exit demo in CUDA examples. Close eunomia-bpf#459 Signed-off-by: Forsworns <[email protected]>

Forsworns force-pushed the atomizer branch from e9d2be3 to d81be72 Compare October 31, 2025 10:27

github-actions bot mentioned this pull request Nov 1, 2025

Monthly Org Report (2025-10-01..2025-10-31) eunomia-bpf/eunomia.dev#50

Open

Uh oh!

feat: support early exit (#459) #481

Are you sure you want to change the base?

feat: support early exit (#459) #481

Uh oh!

Conversation

Forsworns commented Oct 27, 2025

Description

Type of change

How Has This Been Tested?

Uh oh!

Forsworns commented Oct 27, 2025

Uh oh!

yunwei37 commented Oct 27, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Forsworns commented Oct 27, 2025

Uh oh!

yunwei37 commented Oct 30, 2025

Uh oh!

yunwei37 commented Oct 30, 2025

Uh oh!

Forsworns commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Sy0307 commented Oct 31, 2025

Uh oh!

Forsworns commented Nov 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Forsworns commented Oct 31, 2025 •

edited

Loading