Home Using pandas nullable integer dtype in np.where condition

Questions

Using pandas nullable integer dtype in np.where condition

November 24, 2021

I have a DataFrame below which has some missing values.

df = pd.DataFrame(data=[['A', 1, None], ['B', 2, 5]],
                  columns=['X', 'Y', 'Z'])

Since df['Z'] is supposed to be an integer column, I changed its data type to pandas new experimental type nullable integer as below.

ydf['Z'] = ydf['Z'].astype(pd.Int32Dtype())
ydf

    X   Y   Z
0   A   1   <NA>
1   B   2   5

Now I am trying to use a simple numpy where method to replace the non-null values in the column df['Z'] with a fixed integer value (say 1) using the code below.

np.where(pd.isna(ydf['Z']), pd.NA, np.where(ydf['Z'] > 0, 1, 0))

But I get the following error, and I am unable to understand why as I am already checking for the rows with null values in the first condition.

TypeError: boolean value of NA is ambiguous

>Solution :

np.where expects an array of booleans. With the int64 dtype, using > on the Series returns False for nans. With the Int32 dtype (note the capital I), > doesn’t coerce nans to False, thus the error.

One solution would be to use ydf['Z'].gt(0).fillna(False) instead of ydf['Z'] > 0. (They’re the same, the second one just changes NA to False):

np.where(pd.isna(ydf['Z']), pd.NA, np.where(ydf['Z'].gt(0).fillna(False), 1, 0))

missing-data

byMR

Published November 24, 2021

Add a comment

validation returns false, but element still passes. Mongoose

byMR

November 24, 2021

Questions

How to access the final parent StatefulWidget class variables from its extended class?

byMR

November 24, 2021

Questions

Duplicating values from matrix next to the original value in R

byMR

November 24, 2021

Questions

index out of range [113] with length 10

byMR

November 24, 2021

Questions

Insert a Function Name into a Call

byMR

November 24, 2021

Questions

Getting toWei function using window.ethereum

byMR

November 24, 2021

Using pandas nullable integer dtype in np.where condition

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

validation returns false, but element still passes. Mongoose

How to access the final parent StatefulWidget class variables from its extended class?

Duplicating values from matrix next to the original value in R

index out of range [113] with length 10

Insert a Function Name into a Call

Getting toWei function using window.ethereum

Keep Up to Date with the Most Important News

Using pandas nullable integer dtype in np.where condition

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

validation returns false, but element still passes. Mongoose

How to access the final parent StatefulWidget class variables from its extended class?

Duplicating values from matrix next to the original value in R

index out of range [113] with length 10

Insert a Function Name into a Call

Getting toWei function using window.ethereum

Discover more from Dev solutions