Follow

Follow

Contact

Home Count Value Exclude Duplicated ID

Questions

Count Value Exclude Duplicated ID

byMR

November 1, 2022

I have dataframe

df1 = pd.DataFrame({'id': ['1','2','2','3','3','4','5'],
                    'event': ['Basket','Soccer','Soccer','Basket','Soccer','Basket','Soccer']})

I want to count unique values of event but exclude the repeated id. The result I expect are:

event   count   
Basket  3       
Soccer  3

>Solution :

This will work:

df1.groupby('event').agg({'id':lambda x: len(pd.unique(x))})

# OR

df1.groupby(['event']).agg(['nunique'])

Output:

dataframe

byMR

Published November 01, 2022

Add a comment

Leave a ReplyCancel reply

Read more

Questions

Python dataframe: how to remove first column (row number)

byMR

November 1, 2022

Questions

numeric calendar vector for dynamic objects

byMR

November 1, 2022

Questions

Warning Reactjs – undefined

byMR

November 1, 2022

Questions

GitHub Actions Changelog Generator Results in Error

byMR

November 1, 2022