Home Replace specific values in a dataframe by column mean in pandas

Questions

Replace specific values in a dataframe by column mean in pandas

July 7, 2022

I’m a python beginner and I’m trying to do some operations with dataframes that I usually do with R language.

I Have a large dataframe with 2592 rows and 205 columns and I want to replace the 0.0 values by half the minimum value of its column.

An example with a random dataframe would be:

>>> import pandas as pd
>>> import numpy as np
>>> np.random.seed(1)
>>> df = pd.DataFrame(np.random.randint(0,10, size=(3,5)), columns = ['A', 'B', 'C', 'D', 'E'])
>>> print(df)
   A  B  C  D  E
0  5  8  9  5  0
1  0  1  7  6  9
2  2  4  5  2  4

And the result I’m looking for is:

   A  B  C  D  E
0  5  8  9  5  2
1  1  1  7  6  9
2  2  4  5  2  4

Intuitively I would do it like this:

>>> for column in df:
        for element in column:
            if element == 0:
                element = df[column].min()/2

But it doesn’t work… any help?

Thank you!

>Solution :

Use DataFrame.mask with replace minimum values without 0 divide by 2:

df1 = df.mask(df.eq(0), df.replace(0, np.nan).min().div(2), axis=1)
print(df1)
   A  B  C  D  E
0  5  8  9  5  2
1  1  1  7  6  9
2  2  4  5  2  4

For more efficient solution is possible use (thanks @mozway):

m = df.eq(0) 
df1 = df.mask(m, df[~m].min().div(2), axis=1)

for-loop

byMR

Published July 07, 2022

Add a comment

Error report – ORA-02330: datatype specification not allowed 02330. 00000 – when creating object relational table

byMR

July 7, 2022

Questions

how to convert data with float, int and strings to just strings and float

byMR

July 7, 2022

Questions

How can I add the urls in `<img src="…"` to display images in different cards?

byMR

July 7, 2022

Questions

Why getattr is throwing 'module' object is not callable Error

byMR

July 7, 2022

Replace specific values in a dataframe by column mean in pandas

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Error report – ORA-02330: datatype specification not allowed 02330. 00000 – when creating object relational table

how to convert data with float, int and strings to just strings and float

How can I add the urls in `<img src="…"` to display images in different cards?

Why getattr is throwing 'module' object is not callable Error

Keep Up to Date with the Most Important News

Replace specific values in a dataframe by column mean in pandas

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

GroupBy then Select with conditional record addition

Error report – ORA-02330: datatype specification not allowed 02330. 00000 – when creating object relational table

how to convert data with float, int and strings to just strings and float

Bar chart of counts of 1/0 data by group

How can I add the urls in `<img src="…"` to display images in different cards?

Why getattr is throwing 'module' object is not callable Error

Discover more from Dev solutions