Follow

Follow

Contact

Home How to get a count of values in a Pandas DataFrame column within groups?

Questions

How to get a count of values in a Pandas DataFrame column within groups?

byMR

December 16, 2021

I have a DataFrame with a structure like this:

df = pd.DataFrame({
        'id': ['123', '123', '123', '456', '456', '789'],
        'type': ['A', 'A', 'B', 'B', 'C', 'A']
     })

id	type
123	A
123	A
123	B
456	B
456	C
789	A

How can I get a count of each type grouped by id, and create a new column for each unique type?

The resulting DataFrame I’m looking for would look like this:

df = pd.DataFrame({
        'id': ['123', '456', '789'],
        'A': [2, 0, 1],
        'B': [1, 1, 0],
        'C': [0, 1, 0]
    })

id	A	B	C
123	2	1	0
456	0	1	1
789	1	0	0

Thank you for any help and guidance.

>Solution :

You can do:

out = df.groupby(['id','type']).size().unstack().fillna(0).astype(int).rename_axis([None])

or as @Quang Hoang suggested, simply as

out = pd.crosstab(df['id'], df['type']).rename_axis([None])

Output:

type  A  B  C
123   2  1  0
456   0  1  1
789   1  0  0

pandas-groupby

byMR

Published December 16, 2021

Add a comment

Leave a ReplyCancel reply

Read more

Questions

PostgreSQL: Select only past three months of data

byMR

December 16, 2021

Questions

How to change camera with script unity

byMR

December 16, 2021

Questions

Ansible reboot server if task warns

byMR

December 16, 2021

Questions

Get missing values from database check

byMR

December 16, 2021

Questions

SwiftUI: Play button in the native VideoPlayer

byMR

December 16, 2021

Questions

Python Django ModeulNotFoundError: No module named 'SomeFolder'

byMR

December 16, 2021