{{ message }}
Check pointer alignment when copying from strided array to C-contiguous array#1890
Merged
Conversation
Collaborator
Author
|
Deleted rendered PR docs from intelpython.github.com/dpctl, latest should be updated shortly. 🤞 |
|
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_198 ran successfully. |
6df2811 to
cdf8176
Compare
|
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_198 ran successfully. |
Collaborator
Also only enforce alignment on dst pointer
|
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_200 ran successfully. |
8 tasks
1. Save common subexpressions to variables 2. Sub-group size type changed to uint16 (from uint32) 3. sg.get_local_range() replaced with sg.get_max_local_range() This is safe to do since work-group size is chosen to be a multiple of sub-group size for all possile choices of sub-group size (1, 8, 16, 32, 64) 4. Simplified computation of base value in generic branch for complex types, or when sg_load is disabled, to avoid a division (and left a comment)
Also reordered template parameters vec_sz, n_vecs for consistency with the wide code-base.
Contribution to fix gh 1887
|
Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_203 ran successfully. |
Contributor
|
The relevant tests for |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.

This PR resolves #1887
When using sub-group loads and stores, certain alignment of pointers is required. Copies to C-contiguous memory were not properly checking alignment, which would lead to incorrect results.
Before, using the example in #1887:
with this change: