Follow

Follow

Contact

Home Pandas merging value of two rows in columns of a single row

Questions

Pandas merging value of two rows in columns of a single row

byMR

February 5, 2023

I have data like this, it’s output of a groupby:

numUsers = df.groupby(["user","isvalid"]).count()

                      count     
user       isvalid               
5          0.0         1336  
           1.0          387

But I need to have count of count_valid and count_invalid columns for each user, like this:

                    count_valid  count_invalid
user 
5                           387           1336

How can I do it in optimized way in Pandas?

>Solution :

You can use:

out = (df.groupby(["user","isvalid"]).count()
         .rename({0: 'count_invalid', 1: 'count_valid'}, level=1)
         ['count'].unstack()
       )

Output:

isvalid  count_invalid  count_valid
user                               
5                 1336          387

Or, more generic if you have multiple columns, using a MultiIndex:

out = (df.groupby(["user","isvalid"]).count()
         .unstack().rename(columns={0: 'invalid', 1: 'valid'}, level=1)
       )
out.columns = out.columns.map('_'.join)

Output:

      count_invalid  count_valid
user                            
5              1336          387

Or from the original dataset with a crosstab:

pd.crosstab(df['user'], df['isvalid'].map({0: 'count_invalid', 1: 'count_valid'}))

aggregate

byMR

Published February 05, 2023

Add a comment

Leave a ReplyCancel reply

Read more

Questions

trying to generate a vector

byMR

February 5, 2023

Questions

single character to String conversion

byMR

February 5, 2023

Questions

My Swashbuckle.AspNetCore.Swagger have a problem

byMR

February 5, 2023

Questions

set() in Python

byMR

February 5, 2023

Questions

Execution Timed Out (12000 ms) : How can I fix this error

byMR

February 5, 2023

Questions

Problem with int to float typecasting and couting

byMR

February 5, 2023