[Numpy-discussion] A minor clarification no why count_nonzero is faster for boolean arrays

Discussion:

Raghav R V

2015-12-17 17:52:15 UTC

I was just playing with `count_nonzero` and found it to be significantly
faster for boolean arrays compared to integer arrays

a = np.random.randint(0, 2, (100, 5))
a_bool = a.astype(bool)
%timeit np.sum(a)

100000 loops, best of 3: 5.64 Âµs per loop

%timeit np.count_nonzero(a)

1000000 loops, best of 3: 1.42 us per loop

%timeit np.count_nonzero(a_bool)

1000000 loops, best of 3: 279 ns per loop (but why?)

I tried looking into the code and dug my way through to this line
<https://github.com/numpy/numpy/blob/c0e48cfbbdef9cca954b0c4edd0052e1ec8a30aa/numpy/core/src/multiarray/item_selection.c#L2172>.
I am unable to dig further.

I know this is probably a trivial question, but was wondering if anyone
could provide insight on why this is so?

Thanks

R

CJ Carey

2015-12-17 18:37:56 UTC

Permalink

I believe this line is the reason:
https://github.com/numpy/numpy/blob/c0e48cfbbdef9cca954b0c4edd0052e1ec8a30aa/numpy/core/src/multiarray/item_selection.c#L2110

Post by Raghav R V
I was just playing with `count_nonzero` and found it to be significantly
faster for boolean arrays compared to integer arrays

a = np.random.randint(0, 2, (100, 5))
a_bool = a.astype(bool)
%timeit np.sum(a)

100000 loops, best of 3: 5.64 Âµs per loop

%timeit np.count_nonzero(a)

1000000 loops, best of 3: 1.42 us per loop

%timeit np.count_nonzero(a_bool)

1000000 loops, best of 3: 279 ns per loop (but why?)
I tried looking into the code and dug my way through to this line
<https://github.com/numpy/numpy/blob/c0e48cfbbdef9cca954b0c4edd0052e1ec8a30aa/numpy/core/src/multiarray/item_selection.c#L2172>.
I am unable to dig further.
I know this is probably a trivial question, but was wondering if anyone
could provide insight on why this is so?
Thanks
R
_______________________________________________
NumPy-Discussion mailing list
https://mail.scipy.org/mailman/listinfo/numpy-discussion

Benjamin Root

2015-12-17 18:44:40 UTC

Permalink

Would it make sense to at all to bring that optimization to np.sum()? I
know that I have np.sum() all over the place instead of count_nonzero,
partly because it is a MatLab-ism and partly because it is easier to write.
I had no clue that there was a performance difference.

Cheers!
Ben Root

Post by CJ Carey
https://github.com/numpy/numpy/blob/c0e48cfbbdef9cca954b0c4edd0052e1ec8a30aa/numpy/core/src/multiarray/item_selection.c#L2110

Post by Raghav R V
I was just playing with `count_nonzero` and found it to be significantly
faster for boolean arrays compared to integer arrays

a = np.random.randint(0, 2, (100, 5))
a_bool = a.astype(bool)
%timeit np.sum(a)

100000 loops, best of 3: 5.64 Âµs per loop

%timeit np.count_nonzero(a)

1000000 loops, best of 3: 1.42 us per loop

%timeit np.count_nonzero(a_bool)

1000000 loops, best of 3: 279 ns per loop (but why?)
I tried looking into the code and dug my way through to this line
<https://github.com/numpy/numpy/blob/c0e48cfbbdef9cca954b0c4edd0052e1ec8a30aa/numpy/core/src/multiarray/item_selection.c#L2172>.
I am unable to dig further.
I know this is probably a trivial question, but was wondering if anyone
could provide insight on why this is so?
Thanks
R
_______________________________________________
NumPy-Discussion mailing list
https://mail.scipy.org/mailman/listinfo/numpy-discussion

_______________________________________________
NumPy-Discussion mailing list
https://mail.scipy.org/mailman/listinfo/numpy-discussion

Jaime Fernández del Río

2015-12-17 22:02:04 UTC

Permalink

Post by CJ Carey
https://github.com/numpy/numpy/blob/c0e48cfbbdef9cca954b0c4edd0052e1ec8a30aa/numpy/core/src/multiarray/item_selection.c#L2110

The magic actually happens in count_nonzero_bytes_384, a few lines before
that (line 1986).

Jaime

--
(\__/)
( O.o)
( > <) Este es Conejo. Copia a Conejo en tu firma y ayÃºdale en sus planes
de dominaciÃ³n mundial.

Raghav R V

2015-12-17 23:13:15 UTC

Permalink

Thanks a lot everyone!

I am time and again amazed by how optimized numpy is! Hats off to you guys!

R

On Thu, Dec 17, 2015 at 11:02 PM, Jaime FernÃ¡ndez del RÃo <

Post by Jaime FernÃ¡ndez del RÃo

Post by CJ Carey
https://github.com/numpy/numpy/blob/c0e48cfbbdef9cca954b0c4edd0052e1ec8a30aa/numpy/core/src/multiarray/item_selection.c#L2110

The magic actually happens in count_nonzero_bytes_384, a few lines
before that (line 1986).
Jaime
--
(\__/)
( O.o)
( > <) Este es Conejo. Copia a Conejo en tu firma y ayÃºdale en sus planes
de dominaciÃ³n mundial.
_______________________________________________
NumPy-Discussion mailing list
https://mail.scipy.org/mailman/listinfo/numpy-discussion