Skip to content

Conversation

@eexwhyzee
Copy link

@eexwhyzee eexwhyzee commented Dec 3, 2025

resolves: #504

this PR implements additional eBPF helpers funcs for get_sm_id, get_warp_id, and get_lane_id and adds a new GPU example that demonstrates how to trace thread (sm/warp/lane) scheduling using these eBPF helpers.

Testing

tested locally on RTX 3090 (steps to run the example documented in the README):
Screenshot 2025-12-03 at 5 49 30 PM

sidenote: during development with the 3090, i had to explicitly set the env var BPFTIME_SM_ARCH=sm_86 to be able to run bpftime (see: #516), i left this detail out of the README since it looks like SM_ARCH auto-detect will be handled by #524

@eexwhyzee eexwhyzee marked this pull request as draft December 3, 2025 07:06
@eexwhyzee eexwhyzee marked this pull request as ready for review December 4, 2025 04:32
@yunwei37
Copy link
Member

yunwei37 commented Dec 4, 2025

Thanks!
Could you sign off your commits?

@eexwhyzee eexwhyzee force-pushed the cuda-thread-mapping branch from dddd15a to 20415a8 Compare December 4, 2025 05:09
@eexwhyzee
Copy link
Author

@yunwei37 yup yup, just pushed the amended commits with the sign offs

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[FEATURE] Add SM / warp / lane mapping example for bpftime GPU tracing

2 participants