Home Get counts of one numpy array using another array as what to count based on

Questions

Get counts of one numpy array using another array as what to count based on

April 6, 2023

I have the following code using bincounts to get occurrences

print(categories = df[attribute].cat.categories)
>>> Int64Index([0, 1, 2], dtype='int64')
print(df[attribute].to_numpy())
>>> [0 1 0 1 1]
partition = np.bincount(df[attribute].to_numpy())
print(partition)
>>> [2 3]

What I want is so that it is counting but using bins based on the categories array such that it would be [2 3 0] because there are no 2’s in the array. Is there any way to do this? My dataframes are always setup such that categorical data types are integer encoded starting from 0 up to the number of classes. I want to avoid using df[attribute].value_count() because profiling makes it seem like it is a bottleneck, though I’m not entirely sure.

>Solution :

You can use np.unique with return_counts=True:

df = pd.DataFrame({'attribute': [0, 0, 1, 1, 1]})
df = df.astype({'attribute': pd.CategoricalDtype([0, 1, 2])})

cat, count = np.unique(df['attribute'], return_counts=True)

Output:

>>> cat, count
(array([0, 1]), array([2, 3]))

Suggested by @jezrael, to get your expected output, you can use:

>>> pd.Series(count, index=cat).reindex(df['attribute'].cat.categories, fill_value=0)
0    2
1    3
2    0
dtype: int64

But you have to compare the performance with:

>>> df['attribute'].value_counts(sort=False)
0    2
1    3
2    0
Name: attribute, dtype: int64

numpy

byMR

Published April 06, 2023

Add a comment

Deleting files older than 7 days, with exception for 1st day of the month

byMR

April 6, 2023

Questions

Pandas create a new column based on exact match of text values

byMR

April 6, 2023

Questions

how to concat all license number into one array using angular15 and javascript

byMR

April 6, 2023

Questions

How to bind a proper date and time to an input in Vue3?

byMR

April 6, 2023

Questions

How to make datepicker display value form Json string?

byMR

April 6, 2023

Questions

Center slide header in middle of revealjs Quarto

byMR

April 6, 2023

Get counts of one numpy array using another array as what to count based on

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Deleting files older than 7 days, with exception for 1st day of the month

Pandas create a new column based on exact match of text values

how to concat all license number into one array using angular15 and javascript

How to bind a proper date and time to an input in Vue3?

How to make datepicker display value form Json string?

Center slide header in middle of revealjs Quarto

Keep Up to Date with the Most Important News

Get counts of one numpy array using another array as what to count based on

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Deleting files older than 7 days, with exception for 1st day of the month

Pandas create a new column based on exact match of text values

how to concat all license number into one array using angular15 and javascript

How to bind a proper date and time to an input in Vue3?

How to make datepicker display value form Json string?

Center slide header in middle of revealjs Quarto

Discover more from Dev solutions