xe: jit: gemm: improve debugability #3110

rjoursler · 2025-04-16T20:02:45Z

Enables source location forwarding for some simple wrapper functionality. This enables better debugging by enabling developers to focus on the higher level details when using a debugger.

rjoursler · 2025-04-16T20:47:45Z

make test
disable test_device_cpu
disable test_device_gpu

petercad · 2025-04-16T23:41:55Z

src/gpu/intel/jit/gemm/generator/pieces/gemm_setup.cxx

+            emul32High(1, dst.ud(), src0, src1, loc);
        else
-            mul(1, dst, src0, src1);
+            mul(1, dst, src0, src1, loc);


I wonder if we should be thinking about a more general mechanism for "scoping" location information in this way, instead of passing around lots of SourceLocation objects. Something like this:

auto mulHigh = [&](..., SourceLocationScoper scoper = {this}) { // rest of code as before };

And then SourceLocationScoper is defined something like:

template <HW hw> class SourceLocationScoper { SourceLocation loc; BinaryCodeGenerator *generator = nullptr; public: explicit SourceLocationScoper(ngen::BinaryCodeGenerator<hw> *g) : loc{std::source_location::current()}, generator(g) { g->enterLocationScope(loc); } ~SourceLocationScoper() { generator->exitLocationScope(); } }

The new generator methods {enter,exit}LocationScope would set/clear a location override (new variable in the generator class). You'd have a counter as well to properly support nested scopes (only the outermost is honored).

You could also use this method internally inside nGEN (e.g. pseudo-instructions, etc.) to avoid lots of passing around of loc objects.

I don't quite see what the benefit is here. Is this just supposed to be a performance optimization? For release builds, Source Location will be an empty object, so I expect it would mostly be optimized away anyway. If the goal is to just avoid forwarding loc in the source code, I don't see much benefit as forwarding the same location for a "large" operation seems misguided anyway.

In general, I agree we could use some improvement here, I just haven't been able to come up with a good mechanism. The core problem is that we have multiple source locations that we could reasonably map to each instruction, so I don't see a general mechanism we could use to pick the "right" location as it depends on the use and what is normally being debugged.

It's for better readability/easier programmability, rather than performance. The idea is that you only need to change one line to combine location information:

auto mulHigh = [&](..., SourceLocationScoper scoper = {this}) { emul32High(...); // don't have to pass loc here mul(...); // don't have to pass loc here };

instead of modifying every single instruction:

auto mulHigh = [&](..., SourceLocation loc = {}) { emul32High(..., loc); // have to pass loc here mul(..., loc); // have to pass loc here };

When there are a lot of nested instructions, it's easy to miss a loc and just a lot of code changes to make. Since this pattern is appearing in lots of places, it'd be nice to simplify.

rjoursler requested a review from a team as a code owner April 16, 2025 20:02

github-actions bot added the platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel label Apr 16, 2025

rjoursler added 2 commits April 16, 2025 13:47

xe: jit: gemm: forward line information math_utils

91521dc

xe: jit: gemm: forward simple lambda source location

00a7fca

rjoursler force-pushed the rjoursle/gemm_debug branch from 5e4bfd6 to 00a7fca Compare April 16, 2025 20:47

petercad reviewed Apr 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

xe: jit: gemm: improve debugability #3110

xe: jit: gemm: improve debugability #3110

Uh oh!

rjoursler commented Apr 16, 2025

Uh oh!

rjoursler commented Apr 16, 2025

Uh oh!

petercad Apr 16, 2025 •

edited

Loading

Uh oh!

rjoursler Apr 17, 2025 •

edited

Loading

Uh oh!

petercad Apr 17, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xe: jit: gemm: improve debugability #3110

Are you sure you want to change the base?

xe: jit: gemm: improve debugability #3110

Uh oh!

Conversation

rjoursler commented Apr 16, 2025

Uh oh!

rjoursler commented Apr 16, 2025

Uh oh!

petercad Apr 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rjoursler Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

petercad Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

petercad Apr 16, 2025 •

edited

Loading

rjoursler Apr 17, 2025 •

edited

Loading

petercad Apr 17, 2025 •

edited

Loading