Skip to content

Conversation

@mark14wu
Copy link
Collaborator

@mark14wu mark14wu commented Nov 7, 2025

Enhance profiler to show detailed mask usage for individual operations:

  • Track total/false elements per load/store operation
  • Display top 5 operations by false elements with line numbers
  • Extract and show complete multi-line source code statements
  • Add aggregated summary table sorted by mask inefficiency

New utility functions in clients/utils.py:

  • extract_complete_statement_from_line(): Parse multi-line Python statements
  • get_source_location_from_stack(): Extract user code location from call stack

Example output:

    Total elements: 67,108,864
    False elements: 29,360,128 (43.8%)
    Code:
        src_data = tl.load(...)
image

Enhance profiler to show detailed mask usage for individual operations:
- Track total/false elements per load/store operation
- Display top 5 operations by false elements with line numbers
- Extract and show complete multi-line source code statements
- Add aggregated summary table sorted by mask inefficiency

New utility functions in `clients/utils.py`:
- `extract_complete_statement_from_line()`: Parse multi-line Python statements
- `get_source_location_from_stack()`: Extract user code location from call stack

Example output:
```
    Total elements: 67,108,864
    False elements: 29,360,128 (43.8%)
    Code:
        src_data = tl.load(...)
```
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants