iframe-proxy

mattip · 2019-02-13T17:32:49Z

Adds an 'order' kwarg to packbits, unpackbits. Should this hit the mailing list? It is a small enhancement. I removed the separate handling of NPY_BYTE_ORDER, we only have little endian systems to test on, but it shouldn't matter for uint8 processing.

mattip · 2019-02-13T17:33:28Z

There probably is a more elegant way to flip the bits

I think you need to rewrite the loop about, and not do build <<= 1, but rather OR-in an already shifted element (i.e., something like build |= (inputr[j] != 0) << shift where shift depends on order and j).

You mention "adopted" but I think the code is still the same, no? I meant the loop above (not "about"), which now has the build <<= 1. I think that loop can be rewritten to just do the right thing for each order.

I could write this as below, is that better? I don't understand why I needed to use unsigned char instead of char for the order != 'b' case, I am only ever shifting right. If unsigned char is required it should be for the left shifts, no?

Details

for (; index < n_out; index++) { unsigned char build = 0; int i, maxi; npy_intp j; maxi = (index == n_out - 1) ? remain : 8; if (order == 'b') { for (i = 0; i < maxi; i++) { build <<= 1; for (j = 0; j < element_size; j++) { build |= (inptr[j] != 0); } inptr += in_stride; } if (index == n_out - 1) { build <<= 8 - remain; } } else { for (i = 0; i < maxi; i++) { build >>= 1; for (j = 0; j < element_size; j++) { build |= (inptr[j] != 0) ? 128 : 0; } inptr += in_stride; } if (index == n_out - 1) { build >>= 8 - remain; } } *outptr = (char)build; outptr += out_stride; }

mattip · 2019-02-13T17:34:21Z

I think this was a bug, if not we need to add it back in. I don't have a big-endian system to test on.

Hmm, this does get turned into bytes in the end, and those will have a different order in the uint64 here. So, my guess is that you do need to swap (i.e., have the reverse swap for big and little endian). But am a bit too lazy to just write it out... Note that the test cases surely would fail for big endian if this was wrong...

Now tested on a big-endian system since I have access to the gcc build farm and a sparc64 machine. See 5d1b9208d

mattip · 2019-02-25T14:57:46Z

Waiting for #10855 to land, then this will need updating

mattip · 2019-02-26T14:59:41Z

Tests pass, no extra modifications were needed for count

mhvk

I like the idea. Some initial comments.

mhvk · 2019-02-26T19:36:53Z

Flip around for packbits [0, 0, 0, 0, 0, 1, 1] => 3

mhvk · 2019-02-26T19:42:50Z

I think you need to rewrite the loop about, and not do build <<= 1, but rather OR-in an already shifted element (i.e., something like build |= (inputr[j] != 0) << shift where shift depends on order and j).

mhvk · 2019-02-26T19:45:01Z

Put this in an else clause of the if statement below.

mhvk · 2019-02-26T19:49:08Z

I'm confused: exactly the same v is constructed here as below, the only difference being the byte swap. Can we just fill both big and little in one loop? (one the unswapped, the other the swapped v)?

mhvk · 2019-02-26T19:50:48Z

Hmm, this does get turned into bytes in the end, and those will have a different order in the uint64 here. So, my guess is that you do need to swap (i.e., have the reverse swap for big and little endian). But am a bit too lazy to just write it out... Note that the test cases surely would fail for big endian if this was wrong...

mattip · 2019-03-03T21:17:49Z

Following this guide, I set up a mips big-endian qemu machine, translated numpy, and ran the tests in test_packbits.py. They passed.

mhvk

Two comments about comments, otherwise just a question about the initialization.

mhvk · 2019-03-10T17:24:44Z

Adjust the comment to make clear this is for the "big-endian" case.

removed reference to endian, since the 'b' makes the intention clear. Moved the rest of the comment to mark the intrinsic

mhvk · 2019-03-10T17:27:57Z

You mention "adopted" but I think the code is still the same, no? I meant the loop above (not "about"), which now has the build <<= 1. I think that loop can be rewritten to just do the right thing for each order.

mhvk · 2019-03-10T17:29:06Z

Good to adjust this comment too...

"fixed" by removing "big endian"

eric-wieser · 2019-03-10T21:37:15Z

Can we match python here in int.from_bytes (C longobject.c:int_to_bytes_impl), and actually require the full string?

Could consider using the internal _PyUnicode_EqualToASCIIId, which I think is available in the python versions we care about.

I used strncmp instead, which does not require Py_LIMITED_API

We already depend on Py_LIMITED_API, so I don't think we need to care

one of the test runners failed when I used _PyUnicode_EqualToASCIIString, saying the function was not defined.

Edit: fix function name

eric-wieser · 2019-03-10T21:46:14Z

There's no guarantee that char(256) overflows, CHAR_BIT might not be 8.

Why not just use a simple loop here?

for (int i = 0; i < 8; i++) { *outptr = (bool)(*inptr & (1 << i)); outptr += out_stride; }

eric-wieser · 2019-03-10T22:19:46Z

Shouldn't there be 8 items in this list, not 7?

I'm not sure I'd call this the common standard - I frankly find it bizarre that unpackbits(1)[0] does not refer to "bit 0" of x.

Could instead compare to binary literal syntax, stating that 0b00000011 => [0, 0, 0, 0, 0, 0, 1, 1]

Seeing the comment, I remember being bitten by exactly this (which is of course why "little" will be useful!).

Maybe the docstring can be agnostic about which is "standard" - both have advantages. I like the idea of adding 3 = 0b00000011 => [...]

mattip · 2019-03-16T17:03:24Z

@eric-wieser ping

mattip · 2019-03-28T06:58:47Z

@eric-wieser ping

charris · 2019-05-11T16:33:33Z

The order refers to the list of bits in the input.

Rename to bitorder?

redid the comment and changed order -> bitorder

charris · 2019-05-11T18:57:57Z

Should be

bitorder : {'big', 'little'}, optional

Also below.

charris · 2019-05-11T22:26:05Z

Periods, Matti, periods.

charris · 2019-05-12T03:02:58Z

close/reopen

charris · 2019-05-12T03:50:07Z

+`numpy.packbits` and `numpy.unpackbits` accept an ``order`` keyword
+-------------------------------------------------------------------
+The ``order`` keyword defaults to ``big``, and will order the **bits**
+accordingly. For ``'big'`` 3 will become ``[0, 0, 0, 0, 0, 0, 1, 1]``, and


Period. But I will fix it when editing the release note.

charris · 2019-05-12T03:51:40Z

Thanks Matti.

juliantaylor · 2019-05-12T12:34:18Z

    }

-    /* setup lookup table under GIL, big endian 0..256 as bytes */
-    if (unpack_init == 0) {


how does the removal of the lookup table impact performance?
As I recall it was significant when I added it.

juliantaylor · 2019-05-12T12:46:24Z

       before           after         ratio
     [08b17aee]       [e6227a03]
     <v1.16.3^0>       <master>  
+      5.80±0.5μs         46.4±1μs     8.00  bench_core.UnpackBits.time_unpackbits
+         120±2μs         911±20μs     7.58  bench_core.UnpackBits.time_unpackbits_axis1

I realize the lookup table is a bunch of extra code, but it is a factor 8 speedup.
this is pretty significant, it should at least be mentioned in the release notes.

juliantaylor · 2019-05-12T13:47:09Z

mattip added 01 - Enhancement component: numpy._core labels Feb 13, 2019

mattip commented Feb 13, 2019

View reviewed changes

charris changed the title ~~ENH: add 'order' keyword to packbits, unpackbits~~ ENH: Add 'order' keyword to packbits, unpackbits Feb 19, 2019

mattip force-pushed the unpackbits branch from fcc8ab0 to 894c67b Compare February 26, 2019 10:33

mhvk reviewed Feb 26, 2019

View reviewed changes

mattip force-pushed the unpackbits branch from 4f84630 to ea572f1 Compare February 27, 2019 22:27

eric-wieser self-requested a review March 10, 2019 17:21

mhvk reviewed Mar 10, 2019

View reviewed changes