{{ message }}
Fix/5614 grouped conv cpu lowering#6710
Open
alisterburt wants to merge 2 commits into
Open
Conversation
Member
Author
|
@BradLarson thanks for the tip! I had to work around a few little issues to run the test via bazel on my mac but have now confirmed the model successfully compiles in the test after this PR where it previously failed. Note the test is still skioped so wouldn't yet make a regression visible in CI. I considered a separate test but was aiming to keep changes minimal :-) |
Contributor
|
Thanks for helping out! I have an internal PR up now which removes the skip for this test, I'll try to get that out in the next nightly release. Edit: It's been merged |
Contributor
modularbot
pushed a commit
that referenced
this pull request
Jun 24, 2026
reference The conv2d accuracy test was skipped due to an unknown accuracy. This was the reference using TF32 instead of F32. This PR changes the reference to TF32 and enables the test. This is in preparation for #6710. MODULAR_ORIG_COMMIT_REV_ID: 4ffd7bf95d708269cfaf212ac419b34ce3761991
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

Assisted-by: Claude Opus 4.8
Linked issue
Fixes #5614 (PR opened without pre-approval as seems trivial? happy to close and discuss if preferred)
Type of change
Motivation
MAX models with grouped convs would fail to compile despite the kernels supporting grouped conv
repro:
What changed
Testing
I have not built max against the contents of this PR, I don't think this is currently possible with what has been open sourced?
To prove the fix I wrote a small custom op that does exactly
what the grouped CPU lowering should do —
pack_conv_filter_shape→pack_filter(RSCF→FRSCf) →
ConvDirectNHWC[..., filter_packed=True].run(...)— and called itfrom the graph API. It compiles and matches a NumPy grouped-conv reference:
Checklist
agreed-upon, or this is a trivial fix that does not need prior
approval
smaller PRs where possible (see
pull request sizes)
./bazelw run formatto format my changesAssisted-by:trailer in my commit message or this PR description (seeAI Tool Use Policy)
tried to run ./bazelw run format but hit some git error, I think due to my branch naming convention differing from what's expected?